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ASSAYS FOR PROTEIN KINASES USING FLUORESCENT PROTEIN 

SUBSTRATES 

BACKGROUND OF THE INVENTION 
This invention relates to the field of enzymatic assays and, in particular, 
assays for protein kinase activity involving modified fluorescent proteins. 

Protein phosphorylation is one of the most important general mechanisms 
of cellular regulation. "Protein phosphorylation commonly occurs, on three major amino 
acids, tyrosine, serine or threonine, and changes in the phosphorylation state of these 
amino acids within proteins can regulate many aspects of cellular metabolism, regulation, 
growth and differentiation. Changes in the phosphorylation state of proteins, mediated 
through phosphorylation by kinases, or dephosphorylation by phosphatases, is a common 
mechanism through which cell surface signaling pathways transmit and integrate 
information into the nucleus. Given their key role in cellular regulation, it is not 
surprising that defects in protein kinases and phosphatases have been implicated in many 
disease states and conditions. For example, the over-expression of cellular tyrosine 
kinases such as the EGF or PDGF receptors, or the mutation of tyrosine kinases to 
produce constitutive^ active forms (oncogenes) occurs in many cancer cells. Drucker et 
al. (1996) Nature Medicine 2: 561-56. Protein tyrosine kinases are also implicated in 
inflammatory signals. Defective Thr/Ser kinase genes have been demonstrated to be 
implicated in several diseases such as myotonic dystrophy as well as cancer, and 
Alzheimer's disease (Sanpei et al. (1995) Biochem. Biophys. Res. Commun. 212: 341-6; 
Sperber et al (1995) Neurosci. Lett. 197: 149-153; Grammas et al (1995) Neurobiology of 
Aging 16: 563-569; Govoni et al. (1996) Ann. N.Y. Acad. Sci, 111: 332-337). 

The involvement of protein kinases and phosphatases in disease states 
makes them attractive targets for the therapeutic intervention of drugs, and in fact many 
clinically useful drugs act on protein kinases or phosphatases. Examples include 
cyclosporin A which is a potent immunosuppressant that binds to cyclophilin. This 
complex binds to the Ca/calmodul in-dependent protein phosphatase type 2B (calcineurin) 
inhibiting its activity, and hence the activation of T-cells. (Sigal and Dumont (1992), 
Schreiber and Crabtree (1992)). Inhibitors of protein kinase C are in clinical trails as 
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therapeutic agents for the treatment of cancer. {Clin. Cancer Res. (1995) 1:113-122) as 
are inhibitors of cyclin dependent kinase. (/. MoL Med. (1995) 73:(10):509-14.) 

The number of known kinases and phosphatases are growing rapidly as the 
influence of genomic programs to identify the molecular basis for diseases have increased 
5 in size and scope. These studies are likely to implicate many more kinase and 

phosphatase genes in the development and propagation of diseases in the future, thereby 
making them attractive targets for drug discovery. However current methods of 
measuring protein phosphorylation have many disadvantages which prevents or limits the 
ability to rapidly screen using miniaturized automated formats of many thousands of 

10 compounds. This is because many current methods rely on the incorporation and 

measurement of 32 P into the protein substrates of interest. In whole cells this necessitates 
the use of high levels of radioactivity to efficiently label the cellular ATP pool and to 
ensure that the target protein is efficiently labeled with radioactivity. After incubation 
with test drugs, the cells must be lysed and the protein of interest purified to determine 

15 its relative degree of phosphorylation. This method requires high numbers of cells, long 
preincubation times, careful manipulation and washing steps (to avoid artifactual 
phosphorylation or dephosphorylation), as well as a method of purification of the target 
protein. Furthermore, final radioactive incorporation into target proteins is usually very 
low, giving the assay poor sensitivity. Alternative assay methods, for example based on 

20 phosphorylation-specific antibodies using ELISA-type approaches, involve the difficulty 
. of producing antibodies that distinguish between phosphorylated and non-phosphorylated 
proteins, and the requirement for cell lysis, multiple incubation and washing stages which 
are time consuming, complex to automate and potentially susceptible to artifacts. 

Kinase assays based on purified enzymes require large amounts of purified 

25 kinases, high levels of radioactivity, and methods of purification of the substrate protein 
away from incorporated 32 P-labelled ATP. They also suffer from the disadvantage of 
lacking the physiological context of the cell, preventing a direct assessment of a drugs 
toxicity and ability to cross the cells plasma membrane. 

Fluorescent molecules are attractive as reporter molecules in many assay 

30 systems because of their high sensitivity and ease of quantification. Recently, fluorescent 
proteins have been the focus of much attention because they can be produced in vivo by 
biological systems, and can be used to trace intracellular events without the need to be 
introduced into the cell through microinjection or permeabilization. The green 
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fluorescent protein of Aequorea victoria is particularly interesting as a fluorescent 
indicator protein. A cDNA for the protein has been cloned. (D.C. Prasher et al., 
"Primary structure of the Aequorea victoria green-fluorescent protein," Gene (1992) 
111:229-33.) Not only can the primary amino acid sequence of the protein be expressed 

5 from the cDNA, but the expressed protein can fluoresce. This indicates that the protein 
can undergo the cyclization and oxidation believed to be necessary for fluorescence. The 
fluorescence of green fluorescent protein is generated from residues S65-Y66-G67. 

Fluorescent proteins have been used as markers of gene expression, tracers 
of cell lineage and as fusion tags to monitor protein localization within living cells. (M. 

10 Chalfie et al., "Green fluorescent protein as a marker for gene expression," Science 

263:802-805; A.B. Cubitt et al., "Understanding, improving and using green fluorescent 
proteins," TIBS 20, November 1995, pp. 448-455. U.S. patent 5,491,084, M. Chalfie 
and D. Prasher. Furthermore, mutant versions of green fluorescent protein have been 
identified that exhibit altered fluorescence characteristics, including altered excitation and 

15 emission maxima, as well as excitation and emission spectra of different shapes. (R. 
Heim et al., "Wavelength mutations and posttranslational autoxidation of green 
fluorescent protein," Proc. Natl. Acad. ScL USA, (1994) 91:12501-04; R. Heim et al., 
"Improved green fluorescence," Nature (1995) 373:663-665.) These properties add 
variety and utility to the arsenal of biologically based fluorescent indicators. 

20 There is a need for assays of protein phosphorylation that are simple, 

sensitive, non-invasive, applicable to living cells and tissues and that avoid the use of any 
radioactivity. 

SUMMARY OF THE INVENTION 
25 When fluorescent proteins are modified to incorporate a phosphorylation 

site recognized by a protein kinase, the fluorescent proteins not only can become 
phosphorylated by the protein kinase, but they also can exhibit different fluorescent 
characteristics in their un-phosphorylated and phosphorylated forms when irradiated with 
light having a wavelength within their excitation spectrum. This characteristic makes 
30 fluorescent, protein substrates particularly useful for assaying protein kinase activity in a 
sample. 

This invention provides methods for detennining whether a sample 
contains protein kinase activity. The methods involve contacting the sample with a 
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phosphate donor, usually ATP, and a fluorescent protein substrate of the invention; 
exciting the fluorescent protein substrate with light of an appropriate wavelength; and 
measuring the amount of a fluorescent property that differs in the un-phosphorylated state 
and phosphorylated state. An amount that is consistent with the presence of the 
5 fluorescent protein substrate in its phosphorylated state indicates the presence of protein 
kinase activity, and an amount that is consistent with the presence of the protein substrate 
in its un-phosphorylated state indicates the absence of protein kinase activity. 

One embodiment of the above method is for determining the amount of 
protein kinase activity in a sample. In this method, measuring the amount of a 

10 fluorescent property in the sample comprises measuring the amount at two or more time 
points after contacting the sample with a phosphate donor and a fluorescent protein 
substrate of the invention,, and determining the quantity of change or rate of change of 
the measured amount. The quantity or rate of change of the measured amount reflects 
the amount of protein kinase activity in the sample. 

15 In another aspect, the invention provides methods for determining whether 

a cell exhibits protein kinase activity. The methods involve the steps of providing a 
transfected host cell of the invention that produces a fluorescent protein substrate of the 
invention; exciting the protein substrate in the cell with light of an appropriate 
wavelength; and measuring the amount of a fluorescent property that differs in the un- 

20 phosphorylated and phosphorylated states. An amount that is consistent with the 

presence of the protein substrate in its phosphorylated state indicates the presence of 
protein kinase activity, and an amount that is consistent with the presence of the protein 
. substrate in its un-phosphorylated state indicates the absence of protein kinase activity or 
the presence of phosphatase activity. 

25 In another aspect, the invention provides methods for determining the 

amount of activity of a protein kinase in one or more cells from an organism. The 
methods involve providing a transfected host cell comprising a recombinant nucleic acid 
molecule comprising expression control sequences operatively linked to a nucleic acid 
sequence coding for the expression of a fluorescent protein substrate of the invention, the 

30 cell expressing the fluorescent protein substrate; exciting the protein substrate in the cell 
with light; and measuring the amount of a fluorescent property that differs in the un- 
phosphorylated and phosphorylated states at two or more time points after contacting the 
sample with a phosphate donor and a fluorescent protein substrate, and determining the 
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quantity or rate of change of the measured amount. The quantity or rate of change of 
the measured amount reflects the amount of protein kinase activity in the sample. 

This invention also provides screening methods for determining whether a 
compound alters the activity of a protein kinase. The methods involve contacting a 
5 sample containing a known amount of protein kinase activity with the compound, a 
. phosphate donor for the protein kinase and . a fluorescent protein substrate of the 

invention; exciting the protein substrate; measuring the amount of protein kinase activity 
in the sample as a function of the quantity or rate of change of a fluorescent property that 
differs in the un-phosphorylated and phosphorylated states; and comparing the amount of " 

10 activity in the sample with a standard activity for the same amount of the protein kinase. 
A difference between the amount of protein kinase activity in the sample and the standard 
activity indicates that the compound alters the activity of the protein kinase. 

Another aspect of the drug screening methods involve determining whether 
a compound alters the protein kinase activity in a cell. The methods involve providing 

15 first and second transfected host cells exhibiting protein kinase activity and expressing a 
fluorescent protein substrate of the invention; contacting the first cell with an amount of 
the compound; contacting the second cell with a different amount of the compound; 
exciting the protein substrate in the first and second cells; measuring the amount of 
protein kinase activity as a function of the quantity of change or rate of change of a 

20 fluorescent property that differs in the un-phosphorylated and phosphorylated states in the 
first and second cells; and comparing the amount in the first and second cells. A 
difference in the amount indicates that the compound alters protein kinase activity in the 
cell. 

This invention also provides fluorescent protein substrates for a protein 
25 kinase. Fluorescent protein substrates for a protein kinase comprise a fluorescent protein 
moiety and a phosphorylation site for a protein kinase. The protein substrate exhibits a 
different fluorescent property in the phosphorylated state than in the unphosphorylated 
state. In a preferred embodiment, the fluorescent protein is an Aequorea-related 
fluorescent protein. In another embodiment,, the phosphorylation site is located within 
30 about 5, 10, 15 or 20 amino acids of a terminus, e.g., the amino-terminus, of the 

fluorescent protein moiety. In another embodiment, the protein substrate comprises the 
phosphorylation site more than 20 amino acids from a terminal of the fluorescent protein 
moiety and within the fluorescent protein moiety. The phosphorylation site can be one 
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recognized by, for example, protein kinase A, a cGMP-dependent protein kinase, protein 
kinase C, Ca 2+ /caimoduIin-dependent protein kinase I, Cz? + / caimodulin-dependent 
protein kinase II or MAP kinase activated protein kinase type 1 . 

This invention also provides nucleic acid molecules coding for the 
5 expression of a fluorescent protein substrate for a protein kinase of the invention. In one 
aspect, the nucleic acid molecule is a recombinant nucleic acid molecule comprising 
expression control sequences operatively linked to a nucleic acid sequence coding for the 
expression of a fluorescent protein substrate for a protein kinase of the invention. In 
another aspect, the invention provides transfected host cells transfected with a 

10 recombinant nucleic acid molecule comprising expression control sequences operatively 
linked to a nucleic acid sequence coding for the expression of a fluorescent protein 
substrate for a protein kinase of the invention. 

In another aspect, this invention provides collections of fluorescent protein 
candidate substrates comprising at least 10 different members, each member comprising a 

15 fluorescent protein moiety and a variable peptide moiety around the terminus of the 
fluorescent protein moiety. 

In another embodiment, the invention provides collections of recombinant 
nucleic acid molecules comprising at least 10 different recombinant nucleic acid molecule 
members, each member comprising expression control sequences operatively linked to 

20 nucleic acid sequences coding for the expression of a different fluorescent protein 

candidate substrate of the invention. The invention also provides collections of host cells 
comprising at least 10 different host cell members, each member comprising the above 
recombinant nucleic acid molecules. 

The collections of cells are useful in determining the specificity of cellular 

25 kinases, from either diseased or normal tissues. The screening methods involve 

providing a collection of transfected host cells of the invention; culturing the collection of 
host cells under conditions for the expression of the fluorescent protein candidate 
substrate; and determining for each of a plurality of members from the collection 
whether the member contains a fluorescent protein candidate substrate that exhibits a 

30 fluorescent property different than the fluorescent property exhibited by the non- 
phosphorylated candidate substrate. The presence of fluorescent protein candidate 
substrate that exhibits a fluorescent property different than the fluorescent property 
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exhibited by the candidate substrate indicates that the candidate substrate possesses a 
peptide moiety that can be phosphorylated by the kinase present in the host cells. 

This invention also provides kits comprising a fluorescent protein substrate 
and a phosphate donor. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig, 1 is a flow chart showing the steps in an assay method for protein 
kinase activity. 

Fig. 2 depicts molecular events in a cell in altering and detecting 
fluorescent properties of a fluorescent protein substrate for a protein kinase. 

Fig. 3 depicts the nucleotide sequence (SEQ ID NO:l) and deduced amino 
acid sequence (SEQ ID NO:2) of a wild-type Aequorea green fluorescent protein. 

Fig. 4 provides a list of the positions and amino acid changes made for 
phosphorylation mutants made more than fifteen amino acids in the primary sequence 
from the N-terminus, as compared to Fig. 3. Amino acids underlined represent the 
phosphorylation motif, amino acids in brackets represent wild type sequence at those 
positions, 

Fig. 5 depicts plasmid pRSET containing a region encoding GFP that is 
fused in frame with nucleotides encoding an N-terminal polyhistidine tag. 

Figs. 6A-6E show the fluorescence excitation spectra before and after 
phosphorylation of N-terminal phosphorylation mutants by protein kinase A using 
standard phosphorylation conditions. 6A: 1MSRRRRSI (SEQ ID NO:31). 
6B: IMRRRRSII (SEQ ID NO:32). 6C: -1MRRRRSIII (SEQ ID NO:33). 
6D: -2MRRRRSIIIF (SEQ ID NO: 34). 6E: -3MRRRRSIIIIF (SEQ ID NO:35). In all 
cases the spectrum after phosphorylation has higher amplitude than the spectrum before 
phosphorylation. 

Fig. 7 depicts an expression vector having expression control sequences 
operably linked to sequences coding for the expression of protein kinase A catalytic 
subunit (PKA cat) upstream from sequences coding for the expression of a fluorescent 
protein substrate. 

Fig. 8 depicts the fluorescence excitation spectrum of IMRRRRSII (SEQ 
ID NO:33): S65A, N149K, V163A and I167T before and after phosphorylation by 
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protein kinase A using standard phosphorylation conditions. The spectrum after 
phosphorylation has higher amplitude than the spectrum before phosphorylation. 

DETAILED DESCRIPTION OF THE INVENTION 
5 I. METHODS FOR ASSAYING SAMPLES FOR PROTEIN KINASES 

Protein kinases add a phosphate residue to the phosphorylation site of a 
protein, generally through the hydrolysis of ATP to ADP. Fluorescent protein substrates 
for protein kinases are useful in assays to determine the amount of protein kinase activity 
in a sample without the need for radioactivity. The assays of this invention take 

10 advantage of the fact that phosphorylation of the protein substrate results in a change in a 
fluorescent property of the fluorescent protein. Methods for determining whether a 
sample has kinase activity involve contacting the sample with a fluorescent protein 
substrate having a phosphorylation site recognized by the protein kinase to be assayed 
and with a phosphate donor under selected test conditions. A phosphate donor is a 

15 compound containing a phosphate moiety which the kinase is able to use to phosphorylate 
the protein substrate. ATP (adenosine-5 '-triphosphate) is by far the most common 
phosphate donor. In certain instances, the sample will contain enough of a phosphate 
donor to make this step unnecessary. Then the fluorescent protein substrate is excited 
with light in its excitation spectrum. If the protein substrate has been phosphorylated, 

20 the substrate will exhibit different fluorescent properties, indicating that the sample 

contains protein kinase activity. For example, if the phosphorylated form of the protein 
substrate has higher fluorescence than the un-phosphorylated form, the amount of 
fluorescence in the sample will increase as a function of the amount of substrate that has 
been phosphorylated. If the fluorescent property is a change in the wavelength maximum 

25 of emission, the change will be detected as a decrease in fluorescence at the wavelength 
maximum of the un-phosphorylated substrate and an increase in fluorescence at the 
wavelength maximum of the phosphorylated substrate. 

The amount of kinase activity in a sample can be determined by measuring 
the amount of a fluorescent property in the sample at a first time and a second time after 

30 contact between the sample, the fluorescent protein substrate and a phosphate donor, and 
. determining the degree of change or the rate of change in a fluorescent property. For 
example, if phosphorylation results in an increase in fluorescence at the excitation- 
wavelength maximum, the fluorescence of the substrate at this wavelength can be 
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determined at two times. The amount of enzyme activity in the sample can be calculated 
as a function of the difference in the determined amount of the property at the two times. 
For example, the absolute amount of activity can be calibrated using standards of enzyme 
activity determined for certain amounts of enzyme after certain amounts of time. The 
5 faster or larger the difference in the amount, the more enzyme activity must have been 

present in the sample. The amount of a fluorescent property can be determined from any 
spectral or fluorescence lifetime characteristic of the excited substrate, for example, by 
determining the intensity of the fluorescent signal from the protein substrate or the 
excited state lifetime of the protein substrate, the ratio of the fluorescences at two 

10 different excitation wavelengths, the ratio of the intensities at two different emission 
wavelengths, or the excited lifetime of the protein substrate. 

Fluorescence in a sample is measured using a fluorimeter. In general, 
excitation radiation from an excitation source having a first wavelength, passes through 
excitation optics. The excitation optics cause the excitation radiation to excite the 

15 sample. In response, fluorescent proteins in the sample emit radiation which has a 
wavelength that is different from the excitation wavelength. Collection optics then 
collect the emission from the sample. The device can include a temperature controller to 
maintain the sample at a specific temperature while it is being scanned. According to 
one embodiment, a multi-axis translation stage moves a microliter plate holding a 

20 plurality of samples in order to position different wells to be exposed. The multi-axis 

translation stage, temperature controller, auto-focusing feature, and electronics associated 
with imaging and data collection can be managed by an appropriately programmed digital 
computer. The computer also can transform the data collected during the assay into 
another format for presentation. This process can be miniaturized and automated to 

25 enable screening many thousands of compounds. 

Methods of performing assays on fluorescent materials are well known in 
the art and are described in, e.g., Lakowicz, J.R., Principles of Fluorescence 
Spectroscopy, New York:Plenum Press (1983); Herman, B., Resonance energy transfer 
microscopy , in: Fluorescence Microscopy of Living Cells in Culture, Part B, Methods in 

30 Cell Biology, vol. 30, ed. Taylor, D.L. & Wang, Y.-L., San Diego: Academic Press 
(1989), pp. 219-243; Turro, N.J., Modem Molecular Photochemistry, Menlo Park: 
Benjamin/Cummings Publishing Col, Inc. (1978), pp. 296-361. 
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Enzymatic assays also can be performed on isolated living cells in vivo, or 
from samples derived from organisms transfected to express the substrate. Because 
fluorescent protein substrates can be expressed recombinantly inside a cell, the amount of 
enzyme activity in the cell or organism of which it is a part can be determined by 
5 determining a fluorescent property or changes in a fluorescent property of cells or 
samples from the organism. 

In one embodiment, shown in Fig 2, a cell is transiently or stably 
transfected with an expression vector 200 encoding a fluorescent protein substrate 
containing a phosphorylation site for the enzyme to be assayed. This expression vector 

10 optionally includes controlling nucleotide sequences such as promotor or enhancing 
elements. The expression vector expresses the fluorescent protein substrate 210 that 
contains the phosphorylation site 211 for the kinase to be detected. The enzyme to be 
assayed may either be intrinsic to the cell or may be introduced by stable transfection or 
transient co-trans fection with another expression vector encoding the enzyme and 

15 optionally including controlling nucleotide sequences such as promoter or enhancer 

elements. The fluorescent protein substrate and the enzyme preferably are located in the 
same cellular compartment so that they have more opportunity to come into contact. 

If the cell possesses a high degree of enzyme activity (K = "kinase" 220), 
the fluorescent protein substrate will be phosphorylated 230 (P0 4 ), usually through the 

20 hydrolysis of ATP. If the cell does not possess kinase activity, or possesses very little, 
the cell contains substantial amounts of un-phosphorylated substrate 240. Upon 
excitation with light of the appropriate wavelength (hv,) the phosphorylated substrate will 
fluoresce light (hv : ). Un-phosphorylated substrate exhibits different fluorescent 
characteristics upon excitation at the same wavelength, and may, for example, not 

25 fluoresce at all, or fluoresce weakly. The amount of the fluorescent property is 
measured generally using the optics 250 and detector 260 of a fluorimeter. 

If the cell contains phosphatases that compete with the protein kinases, 
removing the phosphate from the protein substrate, the level of enzyme activity in the 
cell can reach an equilibrium between phosphorylated and un-phosphorylated states of the 

30 protein substrate, and the fluorescence characteristics will reflect this equilibrium level. 
In one aspect, this method can be used to compare mutant cells to identify which ones 
possess greater or lesser ratio of kinase to phosphatase activity. Such cells can be sorted 
by a fluorescent ceil sorter based on fluorescence. 
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A contemplated variation of the above assay is to use the controlling 
nucleotide sequences to produce a sudden increase in the expression of either the 
fluorescent protein substrate or the enzyme being assayed, e.g., by inducing expression 
of the construct. A fluorescent property is monitored at one or more time intervals after 
the onset of increased expression. A high amount of the property associated with 
phosphorylated state reflects a large amount or high efficiency of the kinase. This 
kinetic determination has the advantage of minimizing any dependency of the assay on 
the rates of degradation or loss of the fluorescent protein moieties. 

In another embodiment, the vector may be -incorporated into an entire- 
organism by standard transgenic or gene replacement techniques: An expression vector 
capable of expressing the enzyme optionally may be incorporated into the entire organism 
by standard transgenic or gene replacement techniques. Then, a sample from the 
organism containing the protein substrate is tested. For example, cell or tissue 
homogenates, individual cells, or samples of body fluids, such as blood, can be tested. 

II. SCREENING ASSAYS 

The enzymatic assays of the invention can be used in drug screening 
assays to determine whether a compound alters the activity of a protein kinase. In one 
embodiment, the assay is performed on a sample in vitro containing the enzyme. A 
sample containing a known amount of enzyme activity is mixed with a substrate of the 
invention and with a test compound. The amount of the enzyme activity in the sample is 
then determined as above, e.g., by measuring the amount of a fluorescent property at a 
first and second time after contact between the sample, the protein substrate, a phosphate 
substrate, and the compound. Then the amount of activity per mole of enzyme in the 
presence of the test compound is compared with the activity per mole of enzyme in the 
absence of the test compound. A difference indicates that the test compound alters the 

activity of the enzyme. 

In another embodiment, the ability of a compound to alter kinase activity 
in vivo is determined. In an in vivo assay, cells transfected with a expression vector 
encoding a substrate of the invention are exposed to different amounts of the test 
compound, and the effect on fluorescence in each cell can be determined. Typically, the 
difference is calibrated against standard measurements to yield an absolute amount of 
kinase activity. A test compound that inhibits or blocks the activity or expression of the 
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kinase can be detected by a relative increase in the property associated with the un- 
phosphorylated state. The cell can also be transfected with an expression vector to co- 
express the kinase or an upstream signaling component such as a receptor, and 
fluorescent substrate. This method is useful for detecting signaling to a protein kinase of 
interest from an upstream component of the signaling pathway. If a signal from an 
upstream molecule, e.g., a receptor, is inhibited by a drug activity, then the kinase 
activity will not be altered from basal. This provides- a method for screening for 
compounds. which affect cellular events (including receptor-Iigand binding, protein- 
protein interactions or kinase activation) which signal to the target kinase. 

This invention also provides kits containing the fluorescent protein 
substrate and a phosphate substrate for the protein kinase. In one embodiment, the kit 
has a container holding the fluorescent protein substrate and another container holding 
the phosphate substrate. Protein kinases of known activity could be included for use as 
positive controls and standards. 

In - FLUORESCENT PROTEIN SUBSTRATES FOR PROTEIN KINASES 

As used herein, the term "fluorescent property" refers to the molar 
extinction coefficient at an appropriate excitation wavelength, the fluorescence quantum 
efficiency, the shape of the excitation spectrum or emission spectrum, the excitation 
wavelength maximum and emission wavelength maximum, the ratio of excitation 
amplitudes at two different wavelengths, the ratio of emission amplitudes at two different 
wavelengths, the excited state lifetime, or the fluorescence anisotropy. A measurable 
difference in any one of these properties between the. phosphory latcd and un- 
phosphorylated states suffices for the utility of the fluorescent protein substrates of the 
invention in assays for kinase activity. A measurable difference can be determined by 
determining the amount of any quantitative fluorescent property, e.g., the amount of 
fluorescence at a particular wavelength, or the integral of fluorescence over the emission 
spectrum. Optimally, the protein substrates are selected to have fluorescent properties 
that are easily distinguishable in the un-phosphorylated and phosphorylated states. 
Determining ratios of excitation amplitude or emission amplitude at two different 
wavelengths ("excitation amplitude ratioing" and "emission amplitude ratioing", 
respectively) are particularly advantageous because the ratioing process provides an 
internal reference and cancels out variations in the absolute brightness of the excitation 
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source, the sensitivity of the detector, and light scattering or quenching by the sample. 
Furthermore, if phosphorylation of the protein substrate changes its ratio of excitation or 
emission amplitudes at two different wavelengths, then such ratios measure the extent of 
phosphorylation independent of the absolute quantity of the protein substrate. Some of 
the fluorescent protein substrates described herein do exhibit a phosphoiylation-induced 
change in the ratio of excitation amplitudes at two different wavelengths. Even if a 
fluorescent protein substrate does not exhibit a phosphorylation-induced change in 
excitation or emission amplitudes at two wavelengths, cells can be provided that co- 
express another fluorescent protein that is not sensitive to phosphorylation and whose " 
excitation or emission spectrum is peaked at wavelengths distinct from those of the 
phosphorylation substrate. Provided that the expression of the two proteins are both 
controlled by the same nucleotide control sequences, their expression levels should be 
closely linked. Therefore ratioing the excitation or emission amplitude of the 
phosphorylation substrate at its preferred wavelength to the corresponding excitation or 
emission amplitude of the phosphorylation-insensitive reference protein at its separate 
preferred wavelength is an alternative method for canceling out variations in the absolute 
quantity of cells or overall level of protein expression. 

A. Fluorescent Proteins 

As used herein, the term "fluorescent protein" refers to any protein 
capable of fluorescence when excited with appropriate electromagnetic radiation. This 
includes fluorescent proteins whose amino acid sequences are either naturally occurring 
or engineered (i.e., analogs). Many cnidarians use green fluorescent proteins ("GFPs") 
as energy-transfer acceptors in bioluminescence. A "green fluorescent protein/ as used 
herein, is a protein that fluoresces green light. Similarly, "blue fluorescent proteins" 
fluoresce blue light and "red fluorescent proteins" fluoresce red light. GFPs have been 
isolated from the Pacific Northwest jellyfish, Aequorea victoria, the sea pansy, Renilla 
reniformis, and Phialidium gregarium. W.W. Ward et al. f Photochem. Photobiol, 
35:803-808 (1982); L.D. Levine et al. t Comp. Biochem. Physiol., 72B:77-85 (1982). 

A variety of Aequorea-reteted fluorescent proteins having useful excitation 
and emission spectra have been engineered by modifying the amino acid sequence of a 
naturally occurring GFP from Aequorea victoria. (D.C. Prasher et al., Gene, 111:229- 
233 (1992); R. Heim et al., Proc. Natl. Acad. Set., USA, 91:12501-04 (1994); U.S. 
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patent application 08/337,915, filed November 10, 1994; International application 
PCT/US95/14692, filed 11/10/95.) 

As used herein, a fluorescent protein is an "Aequorea-related fluorescent 
protein" if any contiguous sequence of 150 amino acids of the fluorescent protein has at 
least 85% sequence identity with an amino acid sequence, either contiguous or non- 
contiguous, from the 238 amino-acid wild-type Aequorea green fluorescent protein of 
Fig. 3 (SEQ ID NO:2). More preferably, a fluorescent protein is an Aequorea-related 
fluorescent protein if any contiguous sequence of 200 amino acids of the fluorescent 
protein has at least 95% sequence identity with an amino acid sequence, either 
contiguous or non-contiguous, from the wild type Aequorea green fluorescent protein of 
Fig. 3 (SEQ ID NO:2). Similarly, the fluorescent protein may be related to Renilla or 
Phialidium wild-type fluorescent proteins using the same standards. 

Optimal alignment of sequences for aligning a comparison window may be 
conducted by the local homology algorithm of Smith and Waterman (1981) Adv. AppL 
Math., 2:482, by the homology alignment algorithm of Needleman and Wunsch (1970) /. 
Moi BioL t 48:443, by the search for similarity method of Pearson and Lipman (1988) 
Proc. Natl. Acad. ScL, U.S.A., 85:2444, by computerized implementations of these 
algorithms (GAP, BESTFIT, FAST A, and TFASTA in the Wisconsin Genetics Software 
Package Release 7.0, Genetics Computer Group, 575 Science Dr., Madison, WI), or by 
inspection. The best alignment (i.e., resulting in the highest percentage of homology 
over the comparison window, i.e., 150 or 200 amino acids) generated by the various 
methods is selected. 

The percentage of sequence identity is calculated by comparing two 
optimally aligned sequences over the window of comparison, determining the number of 
positions at which the identical amino acid occurs in both sequences to yield the number 
of matched positions, dividing the number of matched positions by the total number of 
positions in the window of comparison (i.e., the window size), and multiplying the result 
by 100 to yield the percentage of sequence identity. 

Aequorea-re\aie6 fluorescent proteins include, for example and without 
limitation, wild-type (native) Aequorea victoria GFP (D.C. Prasher et al., "Primary 
structure of the Aequorea victoria green fluorescent protein," Gene, (1992) 111:229-33), 
whose nucleotide sequence (SEQ ID NO:l) and deduced amino acid sequence (SEQ ID 
NO:2) are presented in Fig. 3, allelic variants of this sequence, e.g., Q80R, which has 



WO 98/02571 PCT7US97/12410 

15 

the glutamine residue at position 80 substituted with arginine (M. Chalfie et al., Science, 
(1994) 263:802-805), those Aequorea-Tt\a\zd engineered versions described in Table I, 
variants that include one or more folding mutations and fragments of these proteins that 
are fluorescent, such as Aequorea green fluorescent protein from which the two amino- 

5 terminal amino acids have been removed. Several of these contain different aromatic 
amino acids within the central chromophore and fluoresce at a distinctly shorter 
wavelength than wild type species. For example, mutants P4 and P4-3 contain (in 
addition to other mutations) the substitution Y66H, whereas W2 and W7 contain (in 
addition to other mutations) Y66W. Other mutations both close to the chromophore 

10 region of the protein and remote from it in primary sequence may affect the spectral 
properties of GFP and are listed in the first part of the table below. 
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Excitation 



Clone 


Mutationfs) 


max (nm) 


Wild 
type 


none 


395 (475) 


P4 


Y66H 


383 


P4-3 


Y66H 
Y145F 


381 


W7 


Y66W 
N146I 
M153T 
VI 63 A 
N212K 


433 (453) 


W2 


Y66W 
1 123V 
Y145H 

M153T 
V163A 
N212K 


432 (453) 


S65T 


S65T 


489 


P4-1 


S65T 

M153A 

K238E 


504 (396) 


S65A 


S65A 


471 


S65C 


S65C 


479 


S65L 


S65L 


484 


Y66F 


Y66F 


360 


Y66W 


Y66W 


458 



Emission Extinct. Coeff. Quantum 

max (nnv> (M 'cm' 1 ) yield 

508 21,000 (7,150) 0.77 

447 13,500 0.21 

445 14,000 0.38 

475 (501) 18,000 0.67 
(17,100) 

480 10,000 (9,600) 0.72 

511 39,200 0.68 

514 14,500 (8,600) 0.53 



504 
507 
510 
442 
480 



Additional mutations in Aequorea-velated fluorescent proteins, referred to 
as "folding mutations," improve the ability of GFP to fold at higher temperatures, and to 
be more fluorescent when expressed in mammalian cells, but have little or no effect on 
the peak wavelengths of excitation and emission. It should be noted that these may be 
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combined with mutations that influence the spectral properties of GFP to produce 
proteins with altered spectral and folding properties. Folding mutations include: T44A, 
F64L, V68L, S72A, F99S, Y145F, N146I, M153T or A, V163A, I167T, S175G, S205T 
and N212K. 

This invention contemplates the use of other fluorescent proteins in 
fluorescent protein substrates for protein kinases. The cloning and expression of yellow . 
fluorescent protein from Vibrio fischeri strain Y-l has been described by T.O. Baldwin et 
al., Biochemistry (1990) 29:5509-15. This protein requires flavins as fluorescent co- 
factors. The cloning of Peridinin-chlorophyll a binding protein from the dinoflagellate 
Symbiodinium sp. was described by B.J. Morris et al., Plant Molecular Biology, (1994) 
24:673:77. One useful aspect of this protein is that it fluoresces red. The cloning of 
phycobiliproteins from marine cyanobacteria such as Synechococcus , e.g., phycoerythrin 
and phycocyanin, is described in S.M. Wilbanks et al., J. Biol Chem. (1993) 268:1226- 
35. These proteins require phycobilins as fluorescent co-factors, whose insertion into the 
proteins involves auxiliary enzymes. The proteins fluoresce at yellow to red 
wavelengths. 

As used herein, the "fluorescent protein moiety" of a fluorescent protein 
substrate is that portion of the amino acid sequence of a fluorescent protein substrate 
which, when the amino acid sequence of the fluorescent protein substrate is optimally 
aligned with the amino acid sequence of a naturally occurring fluorescent protein, lies 
between the amino terminal and carboxy terminal amino acids, inclusive, of the amino 
acid sequence of the naturally occurring fluorescent protein. 

It has been found that fluorescent proteins can be genetically fused to other 
target proteins and used as markers to identify the location and amount of the target 
protein produced. Accordingly, this invention provides fusion proteins comprising a 
fluorescent protein moiety and additional amino acid sequences. Such sequences can be, 
for example, up to about 15, up to about 50, up to about 150 or up to about 1000 amino 
acids long. The fusion proteins possess the ability to fluoresce when excited by 
electromagnetic radiation. In one embodiment, the fusion protein comprises a 
polyhistidine tag to aid in purification of the protein. 
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B. Phosphorylation Sites For Protein Kinases 

Fluorescent protein substrates for a protein kinase are the subset of 
fluorescent proteins as defined above whose amino acid sequence includes a 
phosphorylation site. Fluorescent protein substrates can be made by modifying the amino 
5 acid sequence of an existing fluorescent protein to include a phosphorylation site for a 
protein kinase. Fluorescent protein substrates for protein kinases are not meant to 
include isolated fluorescent proteins that have a naturally occurring phosphorylation site, 
naturally occurring fluorescent proteins or currently known mutant fluorescent proteins. 
Such previously known fluorescent proteins or mutants may be substrates for protein 
10 kinases, but do not exhibit any detectable change in fluorescent properties upon 
phosphorylation. 

As used herein, the term "phosphorylation site for a protein kinase" refers 
to an amino acid sequence which, as part of a polypeptide, is recognized by a protein 
kinase for the attachment of a phosphate moiety. The phosphorylation site can be a site 

15 recognized by, for example, protein kinase A, a cGMP-dependent protein kinase, protein 
kinase C, Ca 2 "7calmodulin-dependent protein kinase I, Ca 2 + / calmodul in-dependent 
protein kinase II or MAP kinase activated protein kinase type 1. 

The preferred consensus sequence for protein kinase A is RRXSZ (SEQ 
ID NO:3) or RRXTZ (SEQ ID NO:4), wherein X is any amino acid and Z is a 

20 hydrophobic amino acid, preferably valine, leucine or isoleucine. Many variations in the 
above sequence are allowed, but generally exhibit poorer kinetics. For example, lysine 
(K) can be substituted for the second arginine. Many consensus sequences for other 
protein kinases have been tabulated, e.g. by Kemp, B.E. and Pearson, R.B. (1990) 
Trends Biochem. Sci. 15: 342-346; Songyang, Z. et al. (1994) Current Biology 4: 973- 

25 982. 

For example, a fluorescent protein substrate selective for phosphorylation 
by cGMP-dependent protein kinase can include the following consensus sequence: 
BKISASEFDR PLR (SEQ ID NO: 5), where B represents either lysine (K) or arginine 
(R), and the first S is the site of phosphorylation (Colbran et al. (1992) J. Biol. Chem. 
30 267: 9589-9594). The residues DRPLR (SEQ ID NO:6) are less critical than the 
phenylalanine (F) just preceding them for specific recognition by cGMP-dependent 
protein kinase in preference to cAMP-dependent protein kinase. 
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Either synthetic or naturally occurring motifs can be used to create a 
protein kinase phosphorylation site. For example, peptides including the motif 
XRXXSXRX (SEQ ID NO:7), wherein X is any amino acid, are among the best 
synthetic substrates (Kemp and Pearson, supra) for protein kinase C. Alternatively, the 
5 Myristoylated Alanine-Rich Kinase C substrate ("MARCKS") is one of the best 

substrates for PKC and is a real target for the kinase in vivo. The sequence around the 
phosphorylation site of MARCKS is KKKKRFSFK (SEQ ID NO:8) (Graff et al. (1991) 
/. Biol. Chem. 266:14390-14398). Either of these two sequences can be incorporated 
into a fluorescent protein to make it a substrate for protein kinase C. 

!0 a protein substrate for Ca 2+ /calmodulin-dependent protein kinase I is 

derived from the sequence of synapsin I, a known optimal substrate for this kinase. The 
recognition sequence around the phosphorylation site is LRRLSDSNF (SEQ ID NO:9) 
(Lee et al. (1994) Proc. Natl. Acad, Sci. USA 91:6413-6417). 

A protein substrate selective for Ca 2 4 /calmodul in-dependent protein kinase 

15 II is derived from the sequence of glycogen synthase, a known optimal substrate for this 
kinase. The recognition sequence around the phosphorylation site is KKLNRTLTVA 
(SEQ ID NO: 10) (Stokoe et al. (1993) Biochem. J. 296:843-849). A small change in 
this sequence to KKA NRTLS V A (SEQ ID NO: 11) makes the latter specific for MAP 
kinase activated protein kinase type 1 . 

20 In one embodiment, the fluorescent protein substrate contains a 

phosphorylation site around one of the termini, in particular, the amino-terminus, of the 
fluorescent protein moiety. The site preferably is located in a position within five, ten, 
fifteen, or twenty amino acids of a position corresponding to the wild type amino- 
terminal amino acid of the fluorescent protein moiety ("within twenty amino acids of the 

25 amino-terminus"). This includes sites engineered into the existing amino acid sequence 
of the fluorescent protein moiety and sites produces by extending the amino terminus of 
the fluorescent protein moiety. 

One may, for example, modify the existing sequence of wild type 
Aequorea GFP or a variant or it as listed above to include a phosphorylation site within 

30 the first ten or twenty amino acids. In one embodiment, the naturally occurring sequence 
is modified as follows: 
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wild type: MSKGEELFTG (1-10 of SEQ ID NO:2) 

substrate: MRRRRSIITG (SEQ ID NO: 12). 

One may include modifying the naturally occurring sequence of Aequorea 
GFP by introducing a phosphorylation site into an extended amino acid sequence of such 
a protein created by adding flanking sequences to the amino terminus, for example: 

wild type: MSKGEELFTG (1-10 of SEQ ID NO:2) 

substrate: MRRRRSIIIIFTG (SEQ ID NO: 13). 

Fluorescent protein substrates having a phosphorylation site around a ' 
terminus of the fluorescent protein moiety offer the following advantages. First, it is 
often desirable to append additional amino acid residues onto the fluorescent protein , 
moiety in order to create a specific phosphorylation consensus sequence. Such a 
sequence is much less likely to disrupt the folding pattern of the fluorescent protein when 
appended onto the terminus than when inserted into the interior of the protein sequence. 
Second, different phosphorylation motifs can be interchanged without significant 
disruption of GFP therefore providing a general method of measuring different kinases. 
Third, the phosphorylation site is exposed to the surface of the protein and, therefore, 
more accessible to protein kinases. Fourth, we have discovered that phosphorylation at 
sites close to the N-terminus of GFP can provide large changes in fluorescent properties 
if the site of phosphorylation is chosen such that the Ser or Thr residue which is 
phosphorylated occupies a position which in the wild-type protein was originally 
negatively or positively charged. Specifically, replacement of Glu 5 or Glu 6 by a non- 
charged Ser or Thr residue can significantly disrupt fluorescence of GFP when made 
within the right context of surrounding amino acids. Phosphorylation of the serine or 
threonine will restore negative charge to this position and thereby increases fluorescence. 

In another embodiment, the fluorescent protein substrate includes a 
phosphorylation site remote from the terminus, e.g., that is separated by more than about 
twenty amino acids from the terminus of the fluorescent protein moiety and within the 
fluorescent protein moiety. One embodiment of this form includes the Aequorea-relaXtd 
fluorescent protein substrate comprising the substitution H217S, creating a consensus 
protein kinase A phosphorylation site. Additionally, phosphorylation sites comprising the 
following alterations based on the sequence of wild type Aequorea GFP exhibit 
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wild type: MSKGEELFTG (1-10 of SEQ ID NO: 2) 
substrate: MRRRRSIITG (SEQ ID NO: 12). 

. One may include modifying the naturally occurring sequence of Aequorea 
GFP by introducing a phosphorylation site into an extended amino acid sequence of such 
5 a protein created by adding flanking sequences to the amino terminus, for example: 
wild type: MSKGEELFTG (1-10 of SEQ ID NO:2) 
substrate: MRRRRSIIIIFTG (SEQ ID NO: 13). 

Fluorescent protein substrates having a phosphorylation site around a " 
terminus of the fluorescent protein moiety offer the following advantages. First, it is 
10 often desirable to append additional amino acid residues onto the fluorescent protein 
moiety, in order to create a specific phosphorylation consensus sequence. Such a 
sequence is much less likely to disrupt the folding pattern of the fluorescent protein when 
appended onto the terminus than when inserted into the interior of the protein sequence. 
Second, different phosphorylation motifs can be interchanged without significant 
15 disruption of GFP therefore providing a general method of measuring different kinases. 
Third, the phosphorylation site is exposed to the surface of the protein and, therefore, 
more accessible to protein kinases. Fourth, we have discovered that phosphorylation at 
sites close to the N-terminus of GFP can provide large changes in fluorescent properties 
if the site of phosphorylation is chosen such that the Ser or Thr residue which is 
20 phosphorylated occupies a position which in the wild-type protein was originally 

negatively or positively charged. Specifically, replacement of Glu 5 or Glu 6 by a non- 
charged Ser or Thr residue can significantly disrupt fluorescence of GFP when made 
within the right context of surrounding amino acids. Phosphorylation of the serine or 
threonine will restore negative charge to this position and thereby increases fluorescence. 
25 In another embodiment, the fluorescent protein substrate includes a 

phosphorylation site remote from the terminus, e.g., that is separated by more than about 
twenty amino acids from the terminus of the fluorescent protein moiety and within the 
fluorescent protein moiety. One embodiment of this form includes the Aequorea-related 
fluorescent protein substrate comprising the substitution H217S, creating a consensus 
30 protein kinase A phosphorylation site. Additionally, phosphorylation sites comprising the 
following alterations based on the sequence of wild type Aequorea GFP exhibit 
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fluorescent changes upon phosphorylation: 69RRFSA (SEQ ID NO:14) and 214KRDSM 
(SEQ ID NO: 15). 

The practitioner should consider the following in selecting amino acids for 
substitution within the fluorescent protein moiety remote in primary amino acid sequence 
5 from the terminus. First, it is preferable to select amino acid sequences within the 

fluorescent protein moiety that resemble the sequence of the phosphorylation site. In this 
way, fewer amino acid substitutions in the native protein are needed to introduce the 
phosphorylation site into the fluorescent protein. For example, protein kinase A 
recognizes the sequence RRXSZ (SEQ ID NO:3) or RRXTZ (SEQ ID NO:4), wherein X 

10 is any amino acid and Z is a hydrophobic amino acid. Serine or threonine is the site of 
phosphorylation. It is preferable to introduce this sequence into the fluorescent protein 
moiety at sequences . already containing Ser or Thr, so that Ser or Thr are not substituted 
in the protein. More preferably the phosphorylation site is created at locations having 
some existing homology to the sequence recognized by protein kinase A, e.g., having a 

15 proximal Arg or hydrophobic residues with the same spatial relationship as in the 
phosphorylation site. 

Second, locations on the surface of the fluorescent protein are preferred 
for phosphorylation sites. This is because surface locations are more likely to be 
accessible to protein "kinases than interior locations. Surface locations can be identified 

20 by computer modeling of the fluorescent protein structure or by reference to the crystal 
structure of Aequorea GFP. Also, charged amino acids in the fluorescent protein are 
more likely to lie on the surface than inside the fluorescent protein, because such amino 
acids are more likely to be exposed to water in the environment. 

In cases where the phosphorylation site is either at the N-terminus or 

25 remote from it, the amino acid context around the phosphorylation site needs to be 

optimized in order to maximize the change in fluorescence. Amino acid substitutions 
that change large bulky and or hydrophobic amino acids to smaller and less hydrophobic 
replacements are generally helpful. Similarly large charged amino acids can be replaced 
by smaller, less charged amino acids. For example: 



30 
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a/Hydrophobic to less hydrophobic 

Phe to Leu 

Leu to Ala 
b/Charged to charged but smaller 
5 . Glu to Asp 

Arg to Lys 
c/Charged to less charged 

Glu to Gin 

Asp to Asn 

10 d/Charged to polar 

Glu to Thr 
Asp to Ser 
e/Charged to non-polar 
Glu to Leu 

15 Asp to Ala 

These changes can be accomplished by directed means or using random 
iterative approaches where changes are made randomly and the best ones selected based 
upon their change in fluorescent properties after phosphorylation by an appropriate 
20 kinase. 

Third, amino acids at distant locations from the actual site of 
phosphorylation can be varied to enhance fluorescence changes upon phosphorylation. 
These mutations can be created through site directed mutagenesis, or through random 
mutagenesis, for example by error-prone PCR, to identify mutations that enhance either 

25 absolute fluorescence or the change in fluorescence upon phosphorylation. The 

identification of mutants remote in primary sequence from the N-terminus identifies 
potentially interacting sequences which may provide additional areas in which further 
mutagenesis could be used to refine the change in fluorescence upon phosphorylation. 
For example, it has been determined that mutations around the amino terminus 

30 phosphorylation site interact (either transiently during folding, or in a stable fashion) with 
amino acids at positions 171 and 172, and that point mutations that significanily disrupt 
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fluorescence of GFP by changing negative to positive charges near the amino terminus 
, can be rescued by changing a positive to a negative charge at position 171. 

In the phosphorylation mutant 50 the sequence is a/ and for reference the 
wild type sequence b/ is listed below. 

5 

a/ MSKRRDSLT (SEQ ID NO: 16) 

b/ MSKGEELFT (1-9 of SEQ ID NO:2) 

The phosphorylation mutant has only 7 % of the fluorescence of wild type 
10 protein. However, its fluorescence can be restored to 80% of wild type by 2 amino acid 
changes, E171K and I172V, positions which are quite remote in linear sequence from the 
amino terminus. 

Thus, changes in charge at E171K (negative to positive) can almost 
completely restore the fluorescence of the phosphorylation mutant, strongly suggesting 

15 that the original loss of fluorescence arose primarily through changes in charge caused by 
the point mutations. It is clear that the addition and loss of charge at positions around, 
and at the phosphorylation site, have a significant impact on fluorescence formation. The 
fact that charge alone can significantly affect the fluorescence properties of GFP is highly 
significant within the scope of the present application since phosphorylation involves the 

20 addition of 2 negative charges associated with the phosphate group (OP0 3 ' 2 ) on the serine 
residue. 

In the above case the mutations restore fluorescence of the phosphorylation 
mutant, without significantly increasing the magnitude of the change in fluorescence upon 
phosphorylation. Nevertheless the identification of these positions in GFP provides a 

25 valuable tool to further optimize changes in fluorescence upon phosphorylation by 
creating random mutations at codons around positions 171, 172 and 173 to identify 
mutations that enhance changes in fluorescence upon phosphorylation. 

This can be achieved by co-expressing the kinase of interest with the , 
fluorescent substrate of the invention containing random mutations which may enhance 

30 the fluorescence changes upon phosphorylation in bacteria (in the example above these 
would be NNK mutations at codons 171, 172 and 173, where N represents a random 
choice of any of the four bases and K represents a random choice of guanine or 
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thymine). The expression vector containing the mutated fluorescent substrates and the 
kinase are transformed into host bacteria and the individual bacterial colonies grown up. 
Each colony is derived from a single cell, and hence contains a single unique mutant 
fluorescent substrate grown up. 
5 The individual colonies may then be grown up and screened for 

fluorescence either by fluorescence activated cell sorting (FACS), or by observation 
under a microscope. Those that exhibit the greatest fluorescence can then be rescreened 
under conditions in which the kinase gene is inactivated. This can be achieved by 
appropriate digests of the kinase gene by restriction enzymes that specifically cut within 
10 the kinase but not GFP. Comparison of the brightness of the mutant first in the presence 
of kinase then in its absence indicates the relative effect of phosphorylation on the mutant 
GFP. 

C. Production Of Fluorescent Protein Substrates For Protein Kinases 
15 While certain fluorescent protein substrates for protein kinases can be 

prepared chemically, for example, by coupling a peptide moiety to the amino terminus of 
a fluorescent protein, it is preferable produce fluorescent protein substrates 
recombinantly. 

Recombinant production of a fluorescent protein substrate involves 
20 expressing a nucleic acid molecule having sequences that encode the protein. As used 

herein, the term "nucleic acid molecule" includes both DNA and RNA molecules. It will 
be understood that when a nucleic acid molecule is said to have a DNA sequence, this 
also includes RNA molecules having the corresponding RNA sequence in which "U" 
replaces "T." The term "recombinant nucleic acid molecule" refers to a nucleic acid 
25 molecule which is not naturally occurring, and which comprises two nucleotide sequences 
which are not naturally joined together. Recombinant nucleic acid molecules are 
produced by artificial combination, e.g., genetic engineering techniques or chemical 
synthesis. 

In one embodiment, the nucleic acid encodes a fusion protein in which a 
30 single polypeptide includes the fluorescent protein moiety within a longer polypeptide. 
In another embodiment the nucleic acid encodes the amino acid sequence of consisting 
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essentially of a fluorescent protein modified to include a phosphorylation site. In either 
case, nucleic acids that encode fluorescent proteins are useful as starting materials. 

Nucleic acids encoding fluorescent proteins can be obtained by methods 
known in the art. For example, a nucleic acid encoding a green fluorescent protein can 
5 be isolated by polymerase chain reaction of cDNA from A. victoria using primers based ' 
on the DNA sequence of A. victoria green fluorescent protein, as presented in Fig. 3. 
PCR methods are described in, for example, U.S. Pat. No. 4,683,195; Mullis et al. 
(1987) Cold Spring Harbor Symp. Quant. Biol. 51:263; and Erlich, ed., PCJR 
Technology, (Stockton Press, NY, 1989). 

10 Mutant versions of fluorescent proteins can be made by site-specific 

mutagenesis of other nucleic acids encoding fluorescent proteins, or by random 
mutagenesis caused by increasing the error rate of PCR of the original polynucleotide 
with 0.1 mM MnCl 2 and unbalanced nucleotide concentrations. See, e.g., U.S. patent 
application 08/337,915, filed November 10, 1994 or International application 

15 PCT/US95/ 14692, filed 11/10/95. 

Nucleic acids encoding fluorescent protein substrates which are fusions 
between a polypeptide including a phosphorylation site and a fluorescent protein and can 
be made by ligating nucleic acids that encode each of these. Nucleic acids encoding 
fluorescent protein substrates which include the amino acid sequence of a fluorescent 

20 protein in which one or more amino acids in the amino acid sequence of a fluorescent 
protein are substituted to create a phosphorylation site can be created by, for example, 
site specific mutagenesis of a nucleic acid encoding a fluorescent protein. 

The construction of expression vectors and the expression of genes in 
transfected cells involves the use of molecular cloning techniques also well known in the 

25 art. Sambrook et al.. Molecular Cloning -- A Laboratory Manual, Cold Spring Harbor 
Laboratory, Cold Spring Harbor, NY, (1989) and Current Protocols in Molecular 
Biology, F.M. Ausubel et al., eds., (Current Protocols, a joint venture between Greene 
Publishing Associates, Inc. and John Wiley & Sons, Inc. 

Nucleic acids used to transfect cells with sequences coding for expression 

30 of the polypeptide of interest generally will be in the form of an expression vector 

including expression control sequences operatively linked to a nucleotide sequence coding 
for expression of the polypeptide. As used, the term nucleotide sequence "coding for 



WO 98/02571 PCTYUS97/12410 

26 

expression of" a polypeptide refers to a sequence that, upon transcription and translation 
of mRNA, produces the polypeptide. As any person skilled in the art recognizes, this 
includes all degenerate nucleic acid sequences encoding the same amino acid sequence. 
This can include sequences containing, e.g., introns. As used herein, the term 
"expression control sequences" refers to nucleic acid sequences that regulate the 
expression of a nucleic acid sequence to which it is operatively linked. Expression 
control sequences are "operatively linked" to a nucleic acid sequence when the 
expression control sequences control and regulate the transcription and, as appropriate, 
translation of the nucleic acid sequence. Thus, expression control sequences can include 
appropriate promoters, enhancers, transcription terminators, a start codon (i.e., ATG) in 
front of a protein-encoding gene, splicing signals for introns, maintenance of the correct 
reading frame of that gene to permit proper translation of the mRNA, and stop codons. 

The recombinant nucleic acid can be incorporated into an expression 
vector comprising expression control sequences operatively linked to the recombinant 
nucleic acid. The expression vector can be adapted for function in prokaryotes or 
eukaryotes by inclusion of appropriate promoters, replication sequences, markers, etc. 

The expression vector can be transfected into a host cell for expression of 
the recombinant nucleic acid. Host cells can be selected for high levels of expression in 
order to purify the protein. E. coli is useful for this purpose. Alternatively, the host 
cell can be a prokaryotic or eukaryotic cell selected to study the activity of an enzyme 
produced by the cell. The cell can be, e.g., a cultured cell or a cell in vivo. 

Recombinant fluorescent protein substrates can be produced by expression 
of nucleic acid encoding for the protein in E. coli. Aequorea-rclated fluorescent proteins 
are best expressed by cells cultured between about 15° C and 30° C but higher 
temperatures (e.g. 37° C) are possible. After synthesis, these enzymes are stable at 
higher temperatures (e.g., 37° C) and can be used in assays at those temperatures. 

The construct can also contain a tag to simplify isolation of the substrate. 
For example, a polyhistidine tag of, e.g., six histidine residues, can be incorporated at 
the amino or carboxyl terminal of the fluorescent protein substrate. The polyhistidine tag 
allows convenient isolation of the protein in a single step by nickel-chelate 
chromatography. 
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Alternatively, the substrates need not be isolated from the host cells. This 
method is particularly advantageous for the assaying for the presence of protein kinase 
activity in situ. 



5 IV. LIBRARIES OF CANDIDATE SUBSTRATES 

The inclusion of a phosphorylation site around the amino terminus of a 
fluorescent protein moiety can provide a fluorescent protein that, when phosphorylated, 
can alter a fluorescent property of the protein. Accordingly, this invention provides 
libraries of fluorescent protein candidate substrates useful for screening in the 
10 identification and characterization of sequences that can be recognized and efficiently 
phosphorylated by a kinase. Libraries of these proteins can be screened to identify 
sequences, that can be phosphorylated by kinases of unknown substrate specificity, or to 
characterize differences in kinase activity in, or from, diseased and normal cells or 
tissues. 

15 As used herein, a "library" refers to a collection containing at least 10 

different members. Each member of a fluorescent protein candidate substrate library 
comprises a fluorescent protein moiety and a variable peptide moiety, which is preferably 
located near the amino-terminus of the fluorescent protein moiety and preferably has 
fewer than about 15 amino acids. The variety of amino acid sequences for the peptide 

20 moiety is at the discretion of the practitioner. For example, the library can contain a 
quite diverse collection of variable peptide moieties in which most or all of the amino 
acid positions are subjected to a non-zero but low probability of substitution. Also, the 
library can contain variable peptide moieties having an amino acid sequence in which 
only a few, e.g., one to ten, amino acid positions are varied, but the probability of 

25 substitution at each position is relatively high. 

Preferably, libraries of fluorescent protein candidate substrates are created 
by expressing protein from libraries of recombinant nucleic acid molecules having 
expression control sequences operatively linked to nucleic acid sequences that code for 
the expression of different fluorescent protein candidate substrates. Methods of making 

30 nucleic acid molecules encoding a diverse collection of peptides are described in, for 
example, U.S. patent 5,432,018 (Dower et al.), U.S. patent 5,223,409 (Ladner et al.), 
U.S. Patent 5,264,563 (Huse), and Internationa] patent publication WO 92/06176 (Huse 
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et al.). For expression of fluorescent protein candidate substrates, recombinant nucleic 
acid molecules are used to transfect cells, such that each cell contains a member of the 
library. This produces, in turn, a library of host cells capable of expressing the library 
of different fluorescent protein candidate substrates. The library of host cells is useful in 
5 the screening methods of this invention. 

In one method of creating such a library, a diverse collection of 
oligonucleotides having preferably random codon sequences are combined to create 
polynucleotides encoding peptides having a desired number of amino acids. The 
oligonucleotides preferably are prepared by chemical synthesis. The polynucleotides 

10 encoding variable peptide moiety can then be coupled to the 5' end of a nucleic acid 

coding for the expression of a fluorescent protein moiety or a carboxy-terminal portion of 
it. That is, the fluorescent protein moiety can be cut back to eliminate up to 20 amino 
acids of the reference fluorescent protein. This creates a recombinant nucleic acid 
molecule coding for the expression of a fluorescent protein candidate substrate having a 

15 peptide moiety fused to the amino terminus of the fluorescent protein. This recombinant 
nucleic acid molecule is then inserted into an expression vector to create a recombinant 
nucleic acid molecule comprising expression control sequences operatively linked to the 
sequences encoding the candidate substrate. 

To generate the collection of oligonucleotides which forms a series of 

20 codons encoding a random collection of amino acids and which is ultimately cloned into 
the vector, a codon motif is used, such as (NNK)„ where N may be A, C, G, or T 
(nominally equimolar), K is G or T (nominally equimolar), and x is the desired number 
of amino acids in the peptide moiety, e.g., 15 to produce a library of 15-mer peptides. 
The third position may also be G or C, designated "S", Thus, NNK or NNS (i) code for 

25 all the amino acids, (ii) code for only one stop codon, and (iii) reduce the range of codon 
bias from 6:1 to 3:1. The expression of peptides from randomly generated mixtures of 
oligonucleotides in appropriate recombinant vectors is discussed in Oliphant et al., Gene 
44:177-183 (1986). 

An exemplified codon motif (NNK) 6 (SEQ ID NO: 17) produces 32 

30 codons, one for each of 12 amino acids, two for each of five amino acids, three for each 
of three amino acids and one (amber) stop codon. Although this motif produces a codon 
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distribution as equitable as available with standard methods of oligonucleotide synthesis, 
it results in a bias against peptides containing one-codon residues. 

An alternative approach to minimize the bias against one-codon residues 
involves the synthesis of 20 activated tn-nucleotides, each representing the codon for one 
5 of the 20 genetically encoded amino acids. These are synthesized by conventional 

means, removed from the support but maintaining the base and 5-HO-protecting groups, 
and activating by the addition of 3'O-phosphoramidite (and phosphate protection with 
beta-cyanoethyl groups) by the method used for the activation of mononucleosides, as 
generally described in McBride and Caruthers, Tetrahedron Letters 22:245 (1983). 

10 Degenerate "oligocodons" are prepared using these trimers as building blocks. The 

trimers are mixed at the desired molar ratios and installed in the synthesizer. The ratios 
will usually be approximately equimolar, but may be a controlled unequal ratio to obtain 
the over- to under-representation of certain amino acids coded for by the degenerate 
oligonucleotide collection. The condensation of the trimers to form the oligocodons is 

15 done essentially as described for conventional synthesis employing activated 

mononucleosides as building blocks. See generally, Atkinson and Smith, Oligonucleotide 
Synthesis, M.J. Gait, ed. p35-82 (1984). Thus, this procedure generates a population of 
oligonucleotides for cloning that is capable of encoding an equal distribution (or a 
controlled unequal distribution) of the possible peptide sequences. 

20 Libraries of amino terminal phosphorylation sites may also be annealed to 

libraries of randomly mutated GFP sequences to enable the selection of optimally 
responding substrates. 

V. METHODS FOR SCREENING LIBRARIES OF CAND IDATE SUBSTRATES 
25 Libraries of host cells expressing fluorescent protein candidate substrates 

are useful in identifying fluorescent proteins having peptide moieties that alter a 
fluorescent property of the fluorescent protein. Several methods of using the libraries 
are envisioned. In general, one begins with a library of recombinant host cells, each of 
which expresses a different fluorescent protein candidate substrate. Each cell is 
30 expanded into a clonal population that is genetically homogeneous. 

In a first method, the desired fluorescent property is measured from each 
clonal population before and at least one specified time after a known change in 
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intracellular protein kinase activity. This change in kinase activity could be produced by 
transfection with a gene encoding the kinase, by induction of kinase gene expression 
using expression control elements, or by any condition that post-translationally modulates 
activity of a kinase that has already been expressed. Examples of the latter include cell 
surface receptor mediated elevation of intracellular cAMP to activate cAMP-dependent 
surface receptor mediated increases of intracellular cGMP to activate cGMP-dependent 
protein kinase, cytosolic free calcium to activate Ca 2+ /calmodul in-dependent protein 
kinase types I, II, or IV, or the production of diacylglycerol to activate protein kinase C, 
etc. One then selects for the clone(s) that show the biggest or fastest change in the 
desired fluorescence property. This method detects fluorescent protein mutants whose 
folding and maturation was influenced by phosphorylation as well as those affected by 
phosphorylation after maturation. 

One embodiment of this method exploits the fact that the catalytic subunit 
of cAMP-dependent protein kinase is constitutively active in the absence of the 
regulatory subunit and is growth-inhibitory in E. coli and most mammalian cells. 
Therefore, the cells tend to shed the kinase gene by recombination. The change in 
kinase activity is obtained by culturing the cells for a time sufficienMo lose the kinase 
gene. 

In a second method the host cells do not express the protein kinase of 
interest. Each clonal population is separately lysed. ATP is then added to the lysate. 
After an incubation period to allow phosphorylation by background kinases, the 
fluorescence property is measured. Then exogenous protein kinase is added to the lysate 
and the fluorescent property is re-measured at one or more specified time points. Again 
one selects for the clone(s) that show the biggest or fastest change(s) in the desired 
fluorescence property. Because little or no fresh protein synthesis is likely to occur in 
the lysate, this method would discriminate against mutants which are sensitive to 
phosphorylation only during their folding and maturation. 

In one embodiment of this method, the lysate is split into two aliquots, one 
of which is mixed with kinase and ATP, the other of which receives only ATP. One 
selects for the clone(s) that show the biggest difference in fluorescence property between 
the two aliquots. 
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The nucleic acids from cells exhibiting the different properties can be 
isolated from the cells. Candidate substrates having different fluorescent properties can 
be tested further to identify the source of the difference. 

The host cell also can be transfected with an expression vector capable of 
expressing an enzyme, such as a protein kinase, whose effect on the fluorescent property 
is to be tested. 

VI. EXAMPLES 

A. Phosphorylation sites located in the amino acid sequence of Aequorea GFP 
remote in the primary amino acid sequence fro m the N-terminus 
Potential sites for phosphorylation were chosen at or close to positions in 
GFP which had previously been identified to exert significant effects on fluorescence, or 
which had a higher probability of surface exposure based on computer algorithms (Fig. 
4). For example, in a mutant called H9, Ser202 and Thr203 are mutated to F and I 
respectively, creating a large change in spectral properties (see also Ehrig el al t 1995). 
Therefore in one mutant, 199RRLSI (SEQ ID NO: 18), a potential site of phosphorylation 
was created around Ser202, whose phosphorylation should significantly affect the 
fluorescent properties. Similarly the amino acids located at positions 72 and 175 have 
been implicated in increased folding efficiency of GFP at higher temperatures and were 
made into potential sites of phosphorylation in separate mutants. 

A complete list of the positions and amino acid changes made for each 
phosphorylation mutant in this series is outlined in Fig. 4. GFP was expressed in E. ccli 
using the expression plasmid pRSET (Invitrogen), in which the-region encoding GFP was 
fused in frame with nucleotides encoding an N-terminal polyhistidine tag (Fig. 5). The 
sequence changes were introduced by site-directed mutagenesis using the Bio-Rad 
mutagenesis kit (Kunkel, T.A. (1985) Proc. Natl. Acad. Sci. 82:488-492, Kunkel T T. A. , 
Roberts, J.D., and Zakour, R.A. (1987) Meth Enzymol 154:367-382) and confirmed by 
sequencing. The recombinant proteins were induced with IPTG and expressed in 
bacteria and purified by nickel affinity chromatography. The sequence changes, relative 
fluorescence, relative rate of phosphorylation and %. change in fluorescence upon 
phosphorylation are listed in Table II. In those cases where the protein exhibited no 
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fluorescence after insertion of the phosphorylation site no determinations were made on 
the effect of phosphorylation on fluorescence. 

Table II: Relative fluorescence, rate of phosphorylation and change in fluorescence upon 
phosphorylation for mutants incorporating phosphorylation sites remote from 
the N-terminus 

SEQ ID Sequence Fluorescence before . Relative rates of % Change in fluorescence 

HO: phosphorylation phosphorylation after incubation with kinase 

(% of wild type) 



19 


25RRFSV 


95 


1.75 


-5 


20 


68RRFSR 


0 


n.d 


n.d 


14 


68RRFSA 


6 


0.6 


+ 10 


21 


94RRSIF 


0 


n.d 


n.d 


22 


131RRGSIL 


0 


n.d 


n.d 


23 


1 55KRKSGI 


86 


2.5 


0 


24 


172RRGSV 


90 


1.57 


0 


IS 


199RRLSI 


0 


n.d 


n.d 


15 


214KRDSM 


21 


1.88 


+40 



Bold letters indicate site of phosphorylation. Numbers prior to the 
sequence indicate amino acid position in wild type GFP (Fig. 3, SEQ ID NO:2) where 
phosphorylation site starts. The relative rates of phosphorylation compare the rate of 
phosphorylation of the given phosphorylation site with the endogenous protein kinase A 
phosphorylation site in Aequorea GFP (HKFSV SEQ ID NO:l) measured by 
incorporation of 32 P after incubation of the purified substrate and protein kinase A 
catalytic subunit in the presence of 32 P-labelled ATP using 3/*g GFP, 5^g protein kinase 
A catalytic subunit for 10 minutes at 30°C in standard phosphorylation buffer ( 20 mM 
MOPS pH 6.5, lOOmM KC1, 100/iM ATP, 3mM MgCI 2 1 mM DTT and lOOuCi 32 P- 
labeled ATP. Reactions were terminated by blotting onto phosphocellulose paper and 
washing with 10% phosphoric acid. The % change in fluorescence represents the 
increase in fluorescence (475 nm excitation, 510 nm emission) observed in each purified 
protein resulting from incubation with excess protein kinase A catalytic subunit for 1 
hour at 30° C using the same phosphorylation conditions as described above except that 
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no 32 P-iabeled ATP was present and that after the reaction time was complete samples 
were analyzed in the fluorimeter rather than blotted onto phosphocellulose paper. 

The greatest changes in fluorescence occurred in mutant 214KRDSM (SEQ 
ID NO: 15) which exhibited a 40% change in fluorescence upon phosphorylation. 
5 However analysis of the kinetics of phosphorylation using 7- 32 P-labeIed ATP 

demonstrated that the site is poorly phosphorylated by protein kinase A. .Wild type GFP 
contains a mediocre consensus phosphorylation site (25HKFSV, from SEQ ID NO:l) that 
can be phosphorylated by protein kinase A in vitro with relatively slow kinetics. While 
phosphorylation at this position has no detectable effect on the fluorescence of GFP, the 
10 rate of phosphorylation at this position is used as an internal control between experiments 
to determine the relative rates of phosphorylation at sites engineered into the protein by 
site directed mutagenesis. 

B. Phosphorylation sites around the amino terminus 

15 Sites at the N-terminus of GFP were engineered into GFP by PCR. Initial 

studies attempted to preserve the native sequence as much as possible. As discussed 
earlier the positions chosen for phosphorylation were within the first 5 amino acids of 
GFP and encompassed all charged residues within this region. The sequence changes, 
relative fluorescence, relative rates of phosphorylation and % change in fluorescence 

20 upon phosphorylation are tabulated in Table III. 

Table III: Relative fluorescence, rate of phosphorylation and change in fluorescence 
upon phosphorylation for phosphorylation sites inserted at the N-terminus 

25 SEQ ID Sequence Relative fluorescence Relative rates of % Change in 



NO: as a % of wild type phosphorylation fluorescence 

2 1MSKGEELF 100 1.0 0 

25 1MRKGSCLF 40 5.1 5.7 

30 26 1MRKGSLLF 52 1.6 8.0 

27 1 MRRESLLF 30 3.0 6.0 

28 1MRRDSCLF 27 3.7 17 

29 1MSRRDSCF 43 2.1 25 

30 1MSKRRDSL 7 .5.5 5.1 
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Numbers prior to the sequence indicate amino acid position in wild type 
GFP where phosphorylation site starts. The relative rates of phosphorylation compare 
the rate of phosphorylation of the given phosphorylation site with the endogenous protein 
kinase A phosphorylation site in Aequorea GFP (HKFSV) measured by incorporation of 
5 32 P after incubation of the purified substrate and protein kinase A catalytic subunit in the 
presence of 32 P-labelled ATP using the standard protocols described earlier. The % 
change in fluorescence represents the change in fluorescence (488 run excitation, 511 nm 
emission) observed in each purified protein as a result of incubation with excess protein 
kinase A catalytic subunit for 1 hour at 30° C using phosphorylation conditions described 
10 earlier. 

These results demonstrated that mutants whose sequence closely resembles 
the native protein retain considerable fluorescence, display good kinetics of 
phosphorylation, but show relatively small changes in fluorescence after phosphorylation. 
To improve the effect of phosphorylation on fluorescence, amino acids around the 
15 phosphorylation site were mutated to create an optimal phosphorylation sequence even if 
it disordered the existing local tertiary structure. Such disruption was predicted and 
found to decrease the basal fluorescence of these constructs in their non-phosphorylated 
state (Table IV). 

20 Table IV: Relative fluorescence before phosphorylation and change in fluorescence 
upon phosphorylation for more drastically altered phosphorylation sites 
inserted at the N-terminus 



25 


SEQ ID 
NO: 


Sequence 


Relative fluorescence 
as a % of wild-type 


% Change in fluorescence 
upon phosphorylation 




2 


1MSKGEELF (=WT) 


= 100 


0 




31 


1MSRRRRSI 


5.8 


40 




32 


1MRRRRSII 


5.1 


70 


30 


33 


-1MRRRRS1II 


n.d. 


43 




34 


-2MRRRRSIIIF 


0.7 


15 




35 . 


-3MRRRRSIIIIF 


0.6 


70 



Numbers prior to the sequence indicate amino acid position in wild type GFP where 
35 phosphorylation site starts. Negative numbers indicate extensions onto the wild-type N- 
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terminus. The % change in fluorescence represents the change in fluorescence (488 
excitation, 511 emission) observed in each purified protein resulting from incubation with 
excess protein kinase A catalytic subunit for 1 hour at 30° C using standard 
phosphorylation conditions described earlier. 

5 Perhaps because of the reduced basal fluorescence, phosphorylation by 

protein kinase A produced greater percentage increases in fluorescence in these 
constructs than in the more conservative mutations of Table II. Constructs 1MRRRRSII 
(SEQ ID NO:32) and -3MRRRRSIIIIF (SEQ ID NO:35) displayed the greatest increases, 
about 70%, in fluorescence upon phosphorylation using the standard conditions, as 

10 shown in Fig. 6. However, these increased percentage increases were obtained at the 
cost of a reduced ability to fold at higher temperatures and relatively poor fluorescence 
even after phosphorylation, to improve these characteristics, these mutants were further 
optimized by additional random mutagenesis with a novel selection procedure. 

15 C. Further optimization of N-terminal phosphor ylation sites by random 

mutagenesis of the remainder of GFP 

The two best constructs from above (1MRRRRSII (SEQ ID NO:32) and - 
3MRRKRSIII IF (SEQ ID NO: 35)) were further mutagenized and screened for variants 
that are highly fluorescent when phosphorylated, but weakly fluorescent when non- 
20 phosphorylated. The method involved expression of a randomly mutated fluorescent 

substrate with or without simultaneous co-expression of the constitutively active catalytic 
subunit of protein kinase A in bacteria, and screening the individual mutants to determine 
those that are highly fluorescent in the presence but not the absence of the kinase. 

To enable co-expression of the kinase and potential substrates, a new 
25 expression vector with the kinase C subunit upstream from the fluorescent substrate was 
constructed (Fig. 7). Random mutations were introduced into GFP by error-prone PGR 
and the resulting population of mutants cloned into the co-expression vector using the 
appropriate restriction sites. The expression vector containing the mutated fluorescent 
substrates were transformed into host bacteria and individual bacterial colonies (each 
30 derived from a single cell, and hence containing a single unique mutant fluorescent 
substrate) were grown up. 



WO 98/02571 PCT/US97/12410 

36 

The colonies were screened for fluorescence either by fluorescence- 
activated cell sorting (Fig. 8) or by observation under a microscope. Those that 
exhibited the greatest fluorescence were re-screened under conditions in which the kinase 
gene was inactivated. This was achieved in either of two ways. In the first method the 
co-expression vector was isolated and treated with restriction endonucleases and 
modifying enzymes (EcoRl, klenow fragment and T4 DNA ligase) to cut the kinase 
gene, add additional bases and religate the DNA, causing a frame shift and hence 
inactivating the gene. The treated and non-treated plasmids were then re-transformed 
into bacteria and compared in fluorescence. Alternatively the plasmids were initially 
grown in a RecA* (recombinase A negative) bacterial strain, where the kinase is stable, 
to screen for brighter mutants in the presence of the kinase. The plasmid DNA was then 
isolated and re-transformed into a strain of bacteria which is RecA + , in which the kinase 
is unstable and is lost through homologous recombination of the tandomly repeated 
ribosome binding sites (rbs). The bacteria have a strong tendency to eliminate the kinase 
C subunit because it slows their multiplication, so cells that splice out the kinase by 
recombination have a large growth advantage. 

Comparison of the brightness of the mutant first in the presence of kinase 
then in its absence indicates the relative effect of phosphorylation on the mutant GFP 
fluorescence (after normalizing for GFP expression levels). A library of approximately 2 
xlO 6 members was screened by this approach. Approximately 500 displayed higher 
levels of fluorescence when screened in the presence of the kinase. After inactivation of 
the kinase, one mutant out of the 500 displayed reduced levels of fluorescence. The 
increased fluorescence of the remainder of the 500 mutants was independent of the 
presence of the kinase. This mutant GFP was isolated and sequenced and found to 
contain the following mutations compared to wild-type GFP (Fig. 3, SEQ ID NO:2) (in 
addition to the N-terminal phosphorylation site 1MRRRRSII (SEQ ID N0.32)): S65A, 
N149K, V163A and I167T. 

To confirm that this mutant was indeed directly sensitive to protein kinase 
A phosphorylation and to quantify its responsively, it was expressed in the absence of 
kinase. The E. colt were lysed and the protein purified as described earlier using a 
nickel affinity column. The protein exhibited high levels of fluorescence when induced 
at 30° C but displayed reduced fluorescence when incubated at 37 °C. After such 
preincubation (37° C overnight) and separation of the less fluorescent material by 
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centrifugation, this protein exhibited the largest change in fluorescence upon 
phosphorylation yet observed (Fig. 8). The tolerance of this mutant for 37 °C treatment 
suggests that this mutant is suitable for use in mammalian cells. 

The present invention provides novel assays for protein kinase activity 
involving novel fluorescent protein substrates. While specific examples have been 
provided, the above description is illustrative and not restrictive. Many variations of the 
invention will become apparent to those skilled in the art upon review of this 
specification. The scope of the invention should, therefore, be determined not with 
reference to the above description, but instead should be determined with reference to the 
appended claims along with their full scope of equivalents. 

All publications and patent documents cited in this application are 
incorporated by reference in their entirety for all purposes to the same extent as if each 
individual publication or patent document were so individually denoted. 
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WHAT IS CLAIMED IS : 

2 I. A method for determining whether a sample contains protein kinase 

3 activity comprising: 

4 contacting the sample with a phosphate donor and a fluorescent 

5 protein substrate for a protein kinase, the protein substrate comprising a fluorescent 

6 protein moiety and a phosphorylation site for a protein kinase, wherein the protein 

7 substrate exhibits a different fluorescent property in the phosphorylated state than in the 

8 un-phosphorylated state; 

9 exciting the protein substrate; and 

10 measuring the amount of a fluorescent property that differs in the 

1 1 un-phosphorylated state and phosphorylated state, whereby an amount that is consistent 

12 with the presence of the protein substrate in its phosphorylated state indicates the 

13 presence of protein kinase activity. 

1 2. The method of claim 1 for determining the amount of protein 

2 kinase activity in a sample wherein measuring the amount of a fluorescent property in the 

3 sample comprises measuring the amount at two or more time points after contacting the 

4 sample with a phosphate donor and a fluorescent protein substrate, and determining the 

5 quantity of change or rate of change of the measured amount, whereby the quantity or 

6 rate of change of the measured amount reflects the amount of protein kinase activity in 
the sample. 

1 3. The method of claim 2 wherein the fluorescent property is the 

2 fluorescent emission around the emission maximum of the substrate in the phosphorylated 
state. 

1 4. The method of claim 2 wherein amount is determined by emission 

2 amplitude ratioing or excitation amplitude ratioing. 

1 5. The method of claim 3 wherein the fluorescent protein is an 

2 Aequorea- related fluorescent protein. 
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1 6. A method for determining whether a cell exhibits protein kinase 

2 activity comprising the steps of: 

3 providing a transfected host cell comprising a recombinant nucleic 

4 acid molecule comprising expression control sequences operatively linked to a nucleic 

5 acid sequence coding for the expression of a fluorescent protein substrate for a protein 

6 kinase, the protein substrate comprising a fluorescent protein moiety containing a 

7 phosphorylation site for a protein kinase, wherein the protein substrate exhibits a 

8 different fluorescent property in the phosphorylated state than in the un-phosphorylated 

9 state, the cell expressing the fluorescent protein substrate; 

10 exciting the , protein substrate in the cell; and 

11 measuring the amount of a fluorescent property that differs in the 

12 un-phosphorylated and phosphorylated states, wherein the presence of the fluorescent 

13 property associated with the fluorescent state indicates the presence of protein kinase 

14 activity in the cell. 

1 7. The method of claim 6 wherein the fluorescent property is the 

2 fluorescent emission around the emission maximum of the substrate in the phosphorylated 
state. 

1 8, The method of claim 6 wherein the amount is determined by 

2 emission amplitude ratioing or excitation amplitude ratioing. 

1 9. The method of claim 6 wherein the cell is further transfected with 

2 an expression vector comprising expression control sequences operatively linked to a 

3 nucleic acid sequence coding for the expression of the protein kinase. 

1 10. The method of claim 6 wherein the fluorescent protein is an 

2 Aequorea-related fluorescent protein. 
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1 11. The method of claim 6 wherein the step of providing a transfected 

2 host cell comprises inducing expression of the protein substrate to produce a sudden 

3 increase in the expression of the protein substrate, and the step of measuring the amount 

4 of a fluorescent property comprises measuring the amount at a first and a second time 

5 after expression of the protein substrate and determining the difference between the 

6 measured amounts at the first and second time. 

1 12. A method for determining the amount of activity of a protein kinase 

2 in a sample from an organism comprising the steps of: 

3 providing a sample from an organism having a cell that expresses a 

4 fluorescent protein substrate for a protein kinase, the protein substrate comprising a 

5 fluorescent protein moiety and a phosphorylation site for a protein kinase, wherein the 

6 protein substrate exhibits a different fluorescent property in the phosphorylated state than 

7 in the un-phosphorylated state; 

8 contacting the sample with a phosphate donor; 

9 exciting the protein substrate; and 

10 measuring the amount of a fluorescent property that differs in the 

1 1 un-phosphorylated state and phosphorylated state, whereby an amount that is consistent 

12 with the presence of the protein substrate in its phosphorylated state indicates the 

13 presence of protein kinase activity, and an amount that is consistent with the presence of 

14 the protein substrate in its un-phosphorylated state indicates the absence of protein kinase 

15 activity. 

1 13. The method of claim 12 wherein the amount is determined by 

2 emission amplitude ratioing or excitation amplitude ratioing. 

1 14. The method of claim 12 wherein the fluorescent protein is an 

2 Aequorea-re\a.tcd fluorescent protein. 



1 



15. The method of claim 12 wherein the sample is a cell homogenate. 
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1 16. A method for determining whether a compound alters the activity 

2 of a protein kinase comprising the steps of: 

3 contacting a sample containing a known amount of protein kinase 

4 activity with the compound, a phosphate donor for the protein kinase and a fluorescent 

5 protein substrate for a protein kinase, the protein substrate comprising a fluorescent 

6 protein moiety and a phosphorylation site for a protein kinase, wherein the protein 

7 substrate exhibits a different fluorescent property in the phosphorylated state than in the 
. 8 un-phosphorylated state; 

9 exciting the protein substrate; 

10 measuring the amount of protein kinase activity in the sample as a 

11 function of the quantity of change or rate of change of a fluorescent property that differs 

12 in the un-phosphorylated and phosphorylated states; and 

13 comparing the amount of activity in the sample with a standard 

14 activity for the same amount of the protein kinase, whereby a difference between the 

15 amount of protein kinase activity in the sample and the standard activity indicates that the 

16 compound alters the activity of the protein kinase. 

! 17, The method of claim 16 wherein the amount is determined by 

2 emission amplitude ratioing or excitation amplitude ratioing. 

1 18. The method of claim 16 wherein the fluorescent protein is an 

2 Aequorea-rei&ied fluorescent protein. 

1 19. A method for determining whether a compound alters the protein 

2 kinase activity in a cell comprising the steps of: 

3 providing first and second transfected host cells exhibiting protein 

4 kinase activity and expressing a fluorescent protein substrate for a protein kinase, the 

5 protein substrate comprising a fluorescent protein moiety and a phosphorylation site for a 

6 protein kinase, wherein the protein substrate exhibits a different fluorescent property in 

7 the phosphorylated state than in the un-phosphorylated state; 

8 contacting the first cell with an amount of the compound; 
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9 contacting the second cell with a different amount of the compound; 

10 exciting the protein substrate in the first and second cells; 

1 1 measuring the amount of protein kinase activity in the cells as a 

12 function of the quantity of change or rate of change of a fluorescent property that differs 

13 in the un-phosphorylated and phosphorylated states in the first and second cells; and 
14 . comparing the amount in the first and second cells, whereby a 

15 difference in the amount indicates that the compound alters protein kinase activity in the 

16 cell. 

1 20, The method of claim 19 wherein the amount is determined by 

2 emission amplitude ratioing or excitation amplitude ratioing. 

1 21. The method of claim 19 wherein the fluorescent protein is an 

, 2 Aequorea-izXdXed. fluorescent protein. 

1 22. The method of claim 18 wherein the cells are transfected with an 

2 expression vector comprising expression control sequences operatively linked to a nucleic 

3 acid sequence coding for the expression of the protein kinase. 

1 23. A fluorescent protein substrate for a protein kinase comprising a 

2 fluorescent protein moiety and a phosphorylation site for a protein kinase, wherein the 

3 protein substrate exhibits a different fluorescent property in the phosphorylated state than 

4 in the un-phosphorylated state. 

1 24. The protein substrate of claim 23 wherein the fluorescent protein is 

2 an Aequorea-izlaxed. fluorescent protein. 

1 25. The protein substrate of claim 24 comprising a phosphorylation site 

2 for protein kinase A, a cGMP-dependent protein kinase, protein kinase C, 

3 Ca 2 "7calmodulin-dependent protein kinase I, Cf* / calmodulin-dependent protein kinase 

4 II or MAP kinase activated protein kinase type 1. 
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1 26. The protein substrate of claim 25 wherein the phosphorylation site 

2 is RRXSZ (SEQ ID NO:3) or RRXTZ (SEQ ID NO:4), wherein X is any amino acid 

3 and Z is a hydrophobic amino acid, BKISASEFDR PLR (SEQ ID NO: 5), where B 

4 represents either lysine (K) or arginine (R), and the first S is the site of phosphorylation, 

5 XRXXSXRX (SEQ ID NO:7), wherein X is any amino acid, KKKKRFSFK (SEQ ID 

6 NO:8), LRRLSDSNF (SEQ ID NO:9), KKLNRTLTVA (SEQ ID NO: 10), 

7 KKANRTLSVA (SEQ ID NO: 11). 

1 27. The protein substrate of claim 19 wherein the Aequorea-velaltd 

2 fluorescent protein is variant P4, P4-3. W7, W2, S65T, P4-1, S65A, S65L, Y66F or 

3 Y66W. . 

1 28. The protein substrate of claim 24 wherein the Aeguorea-relzted 

2 fluorescent protein comprises a folding mutation. 

j 29. The protein substrate of claim 24 comprising the phosphorylation 

2 site within twenty amino acids of the amino-terminus of the fluorescent protein moiety. 

1 30. The protein substrate of claim 23 wherein the phosphorylation site 

2 is contained within the sequence MRRRRSIITG (SEQ ID NO: 12) or MRRRRSII 1IFTG 

3 (SEQ ID NO: 13). 

j 31. The protein substrate of claim 30 further comprising the 

2 substitutions S65A, N149K, V163A and I167T. 



1 
2 



32. The protein substrate of claim 24 comprising the phosphorylation 
site within the fluorescent protein moiety more than twenty amino acids from the amino 
terminus. 
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1 33. The protein substrate of claim 32 comprising the substitution 

2 H217S. 

1 34. The protein substrate of claim 32 wherein the sequence of wild type 

2 Aequorea GFP is modified by the substitution 69RRFSA (SEQ ID NO: 14) or 

3 214KRDSM (SEQ ID NO: 15). 

1 35. The protein substrate of claim 32 further comprising the 

2 substitution E171K and/or 1 172V. 

1 36. The protein substrate of claim 24 wherein the protein substrate is a 

2 fusion protein. 

1 37. A nucleic acid molecule coding for the expression of a fluorescent 

2 protein substrate for a protein kinase comprising a fluorescent protein moiety and a 

3 phosphorylation site for a protein kinase, wherein the protein substrate exhibits a 

4 different fluorescent property in the phosphorylated state than in the un-phosphorylated 

5 state. 

1 38. The nucleic acid molecule of claim 37 wherein the fluorescent 

2 protein is an Aequorea-relzted fluorescent protein. 

1 39. The nucleic acid molecule of claim 37 wherein the protein substrate 

2 comprises a phosphorylation site for protein kinase A, a cGMP-dependent protein kinase, 

3 protein kinase C, Ca 2+ /calmodulin-dependent protein kinase I, Ca 2+ / calmodulin- 

4 dependent protein kinase II or MAP kinase activated protein kinase type 1 . 

1 40. The nucleic acid molecule of claim 37 wherein the phosphorylation 

2 site is RRXSZ (SEQ ID NO:3) or RRXTZ (SEQ ID NO:4), wherein X is any amino acid 

3 and 2 is a hydrophobic amino acid, BKISASEFDR PLR (SEQ ID NO:5), where B 

4 represents either lysine (K) or arginine (R), and the first S is the site of phosphorylation, 



WO 98/02571 PCT/US97/1 24 1 0 

45 

5 XRXXSXRX (SEQ ID NO: 7), wherein X is any amino acid, KKKKRFSFK (SEQ ID 

6 NO:8), LRRLSDSNF (SEQ ID NO:9), KKLNRTLTVA (SEQ ID NO: 10), 

7 KKANRTLSVA (SEQ ID NO: 1 1). 

1 41. The nucleic acid molecule of claim 37 wherein the Aequorea- 

2 related fluorescent protein is variant P4, P4-3, W7, W2, S65T. P4-I, S65A, S65L, 

3 Y66F or Y66W. 

.1 42. The nucleic acid molecule of claim 37 wherein the Aequorea- 

2 related fluorescent protein comprises a folding mutation. 

1 43. The nucleic acid molecule of claim 37 wherein the protein substrate 

2 comprises the phosphorylation site within twenty amino acids of the ami no-terminus of 

3 the fluorescent protein moiety. 

1 44. The nucleic acid molecule of claim 37 wherein the phosphorylation 

2 site is contained within the sequence MRRRRSIITG (SEQ ID NO: 12) or MRRRRSII 

3 IIFTG (SEQ ID NO: 13). 

1 45. The nucleic acid molecule of claim 37 wherein the protein substrate 

2 further comprises the substitutions S65A, NI49K, VI 63 A and I167T. 

1 46. The nucleic acid molecule of claim 37 wherein the protein substrate 

2 comprises the phosphorylation site within the fluorescent protein moiety more than 
twenty amino acids from the amino terminus. 



1 

2 



47. The nucleic acid molecule of claim 37 wherein the protein substrate 
comprises the substitution H217S. 



WO 98/02571 PCT/US97/12410 

46 

48. The nucleic acid molecule of claim 37 wherein the sequence of 
wild type Aequorea GFP is modified by the substitution 69RRFSA (SEQ ID NO: 14) or 
214KRDSM (SEQ ID NO: 15). 

49. The nucleic acid molecule of claim 37 wherein the protein substrate 
further comprises the substitution E171K and/or 1 172V. 



50. A recombinant nucleic acid molecule comprising expression cdntrol 
sequences operatively linked to a nucleic acid sequence coding . for the expression of a 
fluorescent protein substrate for a protein kinase comprising a fluorescent protein moiety 
and a phosphorylation site for a protein kinase, wherein the protein substrate exhibits a 
different fluorescent property in the phosphorylated state than in the un-phosphorylated 
state. 



51. A transfected host cell comprising a recombinant nucleic acid 
molecule comprising expression control sequences operatively linked to a nucleic acid 
sequence coding for the expression of a fluorescent protein substrate for a protein kinase 
comprising a fluorescent protein moiety and a phosphorylation site for a protein kinase, 
wherein the protein substrate exhibits a different fluorescent property in the 
phosphorylated state than in the un-phosphorylated state. 



52, A kit comprising a fluorescent protein substrate for a protein kinase 
comprising a fluorescent protein moiety and a phosphorylation site for a protein kinase, 
and comprising a phosphate donor. 

53. A collection of fluorescent protein candidate substrates comprising 
at least 10 different members, each member comprising a fluorescent protein moiety and 
a variable peptide moiety within about twenty amino acids of the amino -terminus of the 
fluorescent protein moiety. 
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54. The collection of claim 53 containing at least 10 3 different 

members. 



55. The collection of claim 54 containing at least 10 6 different 



members. 



56. A collection of recombinant nucleic acid molecules comprising at 
least 10 different recombinant nucleic acid molecule members, each member comprising 
expression control sequences operatively linked to nucleic acid sequences coding for the 
expression of a different fluorescent protein candidate substrate which comprises a 
fluorescent protein moiety and a variable peptide moiety within about twenty amino adds 
of the amino-terminus of the fluorescent protein moiety. 

57. A collection of host cells comprising at least 10 different host cell 
members, each member comprising a recombinant nucleic acid molecule which 
comprises expression control sequences operatively linked to nucleic acid sequences that 
code for the expression of a different fluorescent protein candidate substrate which 
comprises a fluorescent protein moiety and a variable peptide moiety within about twenty 
amino acids of the amino-terminus of the fluorescent protein moiety. 

58. A method for screening a collection of transfected host cells 

comprising: 

providing a collection of transfected host cells comprising at least 
10 different host cell members, each member expressing a different fluorescent protein 
candidate substrate, the candidate substrate comprising a fluorescent protein moiety and a 
variable peptide moiety within about twenty amino acids of the amino-terminus of the 
fluorescent protein moiety, wherein the fluorescent protein exhibits a fluorescent 
property; 

measuring the fluorescent property in a sample comprising at least 
one host cell member before and after increasing or decreasing the intracellular protein 
kinase activity in the cell; and 
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12 determining the degree of change or the rate of change in the 

13 fluorescent property upon increasing or decreasing intracellular protein kinase activity; 

14 whereby a change in the degree or rate of the fluorescent property 

15 indicates that the candidate substrate possesses a peptide moiety that can be 

16 phosphorylated by the protein kinase and whose phosphorylation alters the fluorescent 

17 property. 

1 59. The method of claim 58 whereby the terminus is the amino- 

2 terminus. 

3 60. The method of claim 59 further comprising determining the change 

4 in a plurality of host cell members and comparing the degree of change or rate of change 

5 between the host cell members; whereby the comparison indicates in the candidate 

6 substrates the relative change in the fluorescent property upon phosphorylation. 

1 61. The method of claim 60 further comprising isolating a member 

2 having the altered fluorescent property. 

1 62. The method of claim 59 wherein the step of determining the rate of 

2 change comprises measuring the fluorescent property at a plurality of time points after 

3 inducing intracellular protein kinase activity. 

1 63. The method of claim 59 wherein sample comprises a clonal 

2 expansion of the host cell. 

1 64. The method of claim 59 wherein the host cell is co-transfected with 

2 an expression vector that expresses a protein kinase comprising expression control 

3 sequences operatively linked to a sequence that codes for the expression of the protein 

4 kinase. 
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1 65. The method of claim 64 wherein the step of increasing the 

2 intracellular protein kinase activity comprises elevating intracellular cAMP to activate 

3 cAMP-dependent protein kinase, elevating intracellular cGMP to activate cGMP- 

4 dependent protein kinase, elevating cytosolic free calcium to activate Ca 2+ /calmodulin- 

5 dependent protein kinase types I, II, or IV, or administration of phorbol myristate acetate 

6 to activate protein kinase C. 

1 66. The method of claim 59 wherein the step of decreasing the - 

2 intracellular protein kinase activity comprises culturing the cell for a time sufficient for 

3 the cell to lose sequence that codes for the expression of the protein kinase. 

1 67. The method of claim 59 further comprising isolating the 

2 recombinant nucleic acid molecule coding for the expression of the candidate substrate. 

1 68. A method for screening a collection of transfected host cells 

2 comprising: 

3 providing a lysate from each of at least one member of a collection 

4 of transfected host cells comprising at least 10 different host cell members, each member 

5 expressing a different fluorescent protein candidate substrate, the candidate substrate 

6 comprising a fluorescent protein moiety and a variable peptide moiety around the amino- 

7 terminus of the fluorescent protein moiety, wherein the. fluorescent protein exhibits a 

8 fluorescent property; 

9 contacting the lysate with a phosphate donor; 

10 measuring the fluorescent property from at least one lysate before 

1 1 and after contacting the lysate with a protein kinase; and 

12 determining the degree of change or the rate of change in the 

13 fluorescent property upon contacting the lysate with intracellular protein kinase activity; 

14 whereby a change in the degree or rate of the fluorescent property 

15 indicates that the candidate substrate possesses a peptide moiety that can be 

16 . phosphorylated by the protein kinase and whose phosphorylation alters the fluorescent 

17 property. 
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1 69. The method of claim 68 comprising splitting the lysate into first 

2 and second aliquots; 

3 contacting the first and second aliquot with a phosphate donor; 

4 contacting the first aliquot with a protein kinase but not contacting 

5 the second aliquot with the protein kinase; 

6 measuring the fluorescent property from the first aliquot before and 

7 after contacting the lysate with the protein kinase; 

8 - measuring the fluorescent property from the second aliquot at a 

9 plurality of time points after contact with the phosphate donor; 

10 determining the degree of change or the rate of change in the 

1 1 fluorescent property of the first and second aliquots; 

12 comparing the degree or rate of change in the first and second 

13 aliquots; 

14 whereby a difference in the degree or rate of change in the 

15 fluorescent property in first and second aliquots indicates that the candidate substrate 

16 possesses a peptide moiety that alters the fluorescent property. 

1 70. The method of claim 68, wherein the amount is determined by 
emission amplitude ratioing or excitation amplitude ratioing, 

1 .71. The method of claim 69 wherein the step of determining the rate of 

2 change comprises measuring the fluorescent property at a plurality of time points after 

3 inducing intracellular protein kinase activity. 

1 72. The method of claim 69 wherein the host cell is expanded into a 

2 clonal population. 

1 73. The method of claim 69 further comprising isolating a member of 

2 the library having an altered fluorescent property. 
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74. The method of claim 72 further comprising isolating the 
recombinant nucleic acid molecule coding for the expression of the candidate substrate. 

75. A method of introducing a phosphorylation site into a fluorescent 
protein comprising the step of genetically attaching an amino acid sequence including a 
phosphorylation site within twenty amino acids of a terminus of a fluorescent protein 
moiety. 



terminus. 



76. 



The method of claim 75 wherein the terminus is the amino- 
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Cxi) SEQUENCE 0ESC8 IPTIOHl 

SEQ ID NO: 1 : atg act aaa cca caa caa ctt ttc act cca ctt ctc cca att ctt ctt 
Z'Q ID NO- 2* Ser Lys Cly CLu Clu Leu Phe Thr Cly Val Vil Pro I la Leu Val 

1.5 10 15 

CAA TTA CAT CCT CAT CTT AAT CCC CAE AAA TTT TCT CTC ACT CCA CAC 
Clu Leu Asp Cly Asp Val Atn Cty Hii Lyi Phe Ser Val Ser Cly Clu 
20 25 30 

CCT CAA CCT CAT CCA ACA TAC CCA AAA CTT ACC CTT AAA TTT ATT TCC 
Cty Ctu Cty Asp Ala Thr Tyr Cly Lyf Leu Thr Leu LyS Phe lie Cys 
35 AO '5 

ACT ACT "CCA AAA CTA CCT CTT CCA TCC CCA AC A CTT CTC ACT ACT TTC 

Thr Thr Cly Lyf Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Phe 
50 55 60 

TCT TAT CCT CTT CAA TCC TTT TCA ACA TAC CCA CAT CAT ATC AAA CCC 
Ser lyr Cly Val Cln Cys Phe Ser Arg fyr Pro Asp wis Het Lys Arg 
AS 70 73 BO 

CAT CAC TTT TTC AAC ACT CCC ATC CCC CAA CCT TAT CTA CAC CAA ACA 
His Asp Phe Pne Lys Ser Ale Met Pro Ctu Cly Tyr Vat Cln Clu Arg 
AS 90 °5 

ACT ATA TTT TTC AAA CAT CAC CCC AAC TAC AAC ACA CCT CCT CAA CTC 
Thr lie Pne Phe Lys Asp Asp Cly Ain Tyr Lys Thr Arg Ala Clu Val 
100 105 110 

AAC TTT CAA CCT CAT ACC CTT CTT AAT ACA ATC CAC TTA AAA CCT ATT 
Lys Phe Clu Cly Asp Thr Leu Val Asn Arg He Ctu Leu Lys Cly lie 
115 120 125 

CAT TTT AAA CAA CAT CCA AAC ATT CTT CCA CAC AAA TTC CAA TAC AAC 
Asp Phe Lys Clu Asp Cly Asn lie Leu Cly His Lys Leu Clu Tyr Asn 
130 135 UO 

TAT AAC TCA CAC AAT CTA TAC ATC ATC CCA CAC AAA CAA AAC AAT CCA 
Tyr Asn Ser Mis Asn Val Tyr tie net Ala asp Lys Cln Lys a in Cly 
145 150 155 l&O 

ATC AAA CTT AAC TTC AAA ATT ACA CAC AAC ATT CAA CAT CCA ACC CTT 
lie Lys Val Asn Phe Lyi He Arg Kit Asn lie Clu Asp Cly Ser Val 
165 170 175 

CAA CTA CCA CAC CAT TAT CAA CAA AAT ACT CCA ATT CCC CAT CCC CCT 
Cln Leu Ala Asp His Tyr Cln Cln Atn Thr Pro He Cly Asp Cly Pro 
1fl0 IBS 190 

CTC CTT TTA CCA CAC AAC CAT TAC CTC TCC ACA CAA TCT CCC CTT TCC 
Vat Leu L«u Pro Asp Asn His Tyr Leu Ser Thr Cln Ser Ala Leu Ser 
195 200 20S 

AAA CAT CCC AAC CAA AAC ACA CAC CAC ATC CTC CTT CTT CAC TTT CTA 
Lys Asp Pro Asn Clu Lys Arg Asp His Ket Val Leu Leu Clu Phe Val 
210 215 220 

ACA CCT CCT CCC *ATT ACA CAT CCC ATC CAT CAA CTA TAC AAA TA 
Thr Ala Ata Cly He Thr His Cly her Asp Clu Leu Tyr Lys 
225 230 235 



FIGURE 3 
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Figure S. UxitUoni of phosphorylation sites distal to the N-tizmnus . 
abIso &cltli underlined represent the phosphorylation moti£, a mino acids 
in brackets rtpxaitnt wild type sequence at those positions. 



ATG ACT AAA GCA GAA GAA CTT TTC ACT GGA CTT CTC CCA ATT CTT GTT GAA TTA GAT GGT 

Hat Ser Lys Gly Clu Clu Lou Phe Thr Gly val Val Pro II© Leu Val Clu Lou Asp Gly 
BO HX) 120 
(His Lys) * * 

GAT GTT AAT GGG ACA AGA TTT TCT CTC ACT GGA GAG GGT GAA GGT CAT GCA ACA TAG GCA 

Asp Val Asn Gly *™ Are ser Val Sex Gly Glu Gly Gla Gly Asp Ala Thr Tyr Gly 
140 ISO 1B0 

* * * 

AAA CTT ACC CTT AAA TTT ATT TGC ACT ACT GGA AAA CTA CUT GTT CCA TGG CCA ACA CTT 

Lys Leu Thr Lou Lya Phe lie Cya Thr Thr Gly Lya Leu Pro Val Pro Trp Pro "Thx Leo 

200 220 240 

* (Gin Cya ) (Argj* * 
CTC ACT ACT TTC TCT TAT GGT GTT AGA AGA TTT TCA GCA TAC CCA CAT CAT ATG AAA CAG 
val Thr Thr Phe 5er Tyr Gly Val trn ftr ? Ph P *~<r Tyr Pro Asp His Met Lya Gin 

260 290 300 

* * (Glu) _ _ * ■ 

CAT GAC TTT TTC AAG ACT GCC ATG CCC GAA GGT TAT GTA CAG AGA AGA TCT" ATA TTT TTC 

Hia Asp Phe- Phe Lya Sex Ala Met Pro Glu Gly Tyr Val Gin »rg KTQ ftp** Tie Phe Phe 

320 340 360 

AAA GAT GAC GGG AAC TAC AAG ACA CCT CCT GAA CTC AAG TTT GAA GGT GAT ACC CTT GTT 
Lya Asp Asp Gly Aan. Tyr Lya Thr Arg Ala Clu Val Lya Phe Glu Gly Aap Thr Leu val 
390 400 

* (Glu Aap)* (Asn) * 

AAT AGA ATC CAG TTA AAA GGT ATT GAT TTT AAA AGA AGA GGA AAC ATT CTT CCA CAC AAA 
Aan Arg He Glu Leu Lya Gly lie Asp Phe Lya *™ >™ m y Tin Leu Gly Bia Lya 

440 460 480 

* « (Gla) (Afl0> * 
TTG GAA TAC AAC TAT AAC TCA CAC AAT GTA TAC ATC ATG GCA GAC AAA AGA AAG TCT GGA 
Leu Glu Tyr Atn Tyr Aan Ser His Asn Val Tyr lie Met Ala Aap Tvn *XP TYT firr lilV 

500 522 ^ 

* (Glu Asp)* 

ATC AAA GTT AAC TTC AAA ATT AGA CAC AAC ATT AGA AGA GGA AGC GTT CAA CTA GCA GAC 

He Lya Val Asn Phe Lya He Arg His Asn lie a™ Am filY g *" r VaI CLn Lau 

560 580 600 

. . (Bis Tyr) 

CAT TAT CAA CAA AAT ACT CCA ATT GGC GAT CGC CCT CTC CTT TTA CCA GAC AAC AGA AGA 

His Tyr Gin Gin Asn Thr Pro He Gly Aap Gly Pro Val Lau Leu Pro Asp Asn ft hi atb 

G2D 640 660 



(Thr) 



(HIS) 



CTC TCC ATA CAA TCT GCC CTT TCG AAA GAT CCC AAC GAA AAG AGA GAC Leu 
t.mr, ser He Gin Ser Ala Leu Ser Lya Aap Pro Asn Glu T i V^ Urn MP nfT nrr* vaJ - 
£80 700^ 

CTT GAG TTT GTA ACA CCT CCT GGG ATT ACA CAT GGC ATG GAT Tvr LVS '•••^ 

Leu Glu Phe val Thr Ala Ala Gly He Thr Hia Gly Ket Asp Glu Leu Tyr Lys 



4/12 



WO 98/02571 PCI7US97/12410 

GFP Bacteria Expression Cassette 

The BamHl fragment of GFP in Biuescript 1 1 was cloned into the BamHI- 
site of pRSETg (from tnvitrogen) 




M«l fraftond Bl.nd1.a9 Bouia Whml 

GGACATATACAT ATC CCC OCT TCT CAT CAT CAT CAT CAT CAT CCT ATC CCT ACC ATC ACT CCT CCA CAG 
K*t At q Cly sar Hi» «i« Uii His 8is mi Cly H«t Al» »«r M«t Thr cly Cly Cln 



EX Cl*»*«f« c*«»i off 

CAA ATC CCT CCC CAT CTC TAC GAC CAT CAC CAT AAC CAT CCC CCC CCT CAA TTC ATC ACT TAC 

Gin nit Cly Ar? Mp l_«u Tyr K*p A«p A»p A»p lym A»p Pro Pro Ala Clu Ph» M«* «« *T r 

KiTtMl I*cl Xttol MUX »■« P»»ll »™l liUl tlftdlll 

AAA TAA TAX GCATCCGACCTCGAGATCTCCACCTOGTACCATGGAATTCGAAGCT*rCA 
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Dual PKA Cat /gfp expression canettc 



Zte 1 Kpn-1 ira g n an t of PKA car subcloned into pRSET b into 
blunt 



Zba 1 site 



as *c ft. ft. m Ma 




sea l 

■t«al »*■»•*« almtfi&g omli Max 

OCAS AT AX AC AT AI3 COS GST TCT CAT CAT CAT CAT CAT CAT GOT A TO OCT AQC ATO ACT DOT OOA CAG 
M«t atq Oir &*r Bla Hi« Bla ui an His air *•* Ala B»r not Tnr Oiy 017 Oln 

XE aai^t MXtm ft**U. tea Z 3Qm>3. ftel It »»t I P<M S 

CAA AT9 OCT CGO OAT CTO TAC GAC OAT OAC OAT A AO OAT CCO AGC TCO A OA TCT OCA OCT 



KVt-1 O0 ■fit, ttt&d ZZZ 

GOT ACC ATS ASA ASA A OA ASA TCA AAA TAA AAOCTTOA 
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Box II Observations where unity of invention i* lacking (Continuation of item 1 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 
Please See Extra Sheet. 



t . | j Aa all required addition* J search fees were timely paid by the applicant, this international search report coven all searchable 
claims. 

2. | | As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. 

3. ( x( Aa only some of the required additional search fcea were timely paid by the applicant, this international search report covers 
only those claims for which fees were paid, specifically claims Nos.: 

1-52, 75. 76 



4. j' " [ No required additional search fees were timely paid by the applicant. Consequently, this international search report is 
restricted to the invention first mentioned in the claims: it is covered by claims Nos.: 



Remark on Protest | ] The additional search fcea were accompanied by the applicant's protest. 

j | No protest accompanied the payment of additional search fee*. 

Form PCT/1SA/210 (continuation of first ihcct(l»(July 19921* 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US97/I2410 



A. CLASSIFICATION OF SUBJECT MATTER: 
IPC (6): 

C12Q IMS. 1/6*: C07K 19/00; C12N 15/62. 15/11, 1/21, 15/10. 15/09 

B. FIELDS SEARCHED 

Electronic data base* consulted (Name of data base and where practicable term* uaed): 

APS. MEDLINE, SCISEARCH. LIFESCI. BIOTECHDS. BIOS1S. EMBASE. CAS. NTIS. WPI 

•earco terms: proteun kinase*. uuyV or meaaur?. phoaphoryUt?. site* or domain* or region*, fusion or insert?, 

fluoreacen?, green fluorescent p roic in* or gfp 

BOX II. OBSERVATIONS WHERE UNITY OF INVENTION WAS LACKING 
This ISA found multiple invention a as* follows: ' 

Thii application containj the following inventions or group, of invention! which are not to linked as to form a single 
inventive concept under PCT Rule 13. 1. In order for all inventions to. be searched, the appropriate additional search 
fees must be paid. 

Group I. claims 1-36. 52. 75. and 76. drawn to a fluorescent protein substrate for a protein kinase, method of making 
laid substrate and method of use of said substrate. 

Group II. claims 37-51. drawn to nucleic aeids encoding a fluorescent protein substrate for a protein kinase. 
Group III, claims 33-55. drawn to peptide library. 

Group IV, claims 56-74. drawn to » gene library and method of uae thereof. 

™ bT*^ ^ " GmUP * MV d ° 001 reUle 10 * ,in e ,c inventive concept under PCT Rule 13.1 because, under 
7' , T' 7? y Uck 1AmC OT corresponding special technical features for the following reasons: The proteins 
l\ £™ P DNA ° f Crou P 11 comprue unrelated chemical structures. The protein, of Group I and peptide library 

vLZZ , r ™ ' lCChmCal fcAmPC " P"^^ of Gro "P 1 *«= not quired to be a pan of the peptide 

Z5Z °.l.H ro T^ ^ OUCiCiC * Cidj ° f Gn>Up 11 ^ Ubr% *y ° f Group IV do not share a technical feature a. the 
^ I UP , ™ »° be a part of the peptide library of Group IV. Accordingly, the claims are 

not so linked by a special technical feature within the meaning of PCT Rule 13.2 so as to form a single inventive 
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